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HYPOXIA-REGULATED GENES 

™s tavendon « ^ and Se» P™d»cU ^ - taP>i-=^ '» 

States. 

AH pubUcado., pa»„. and pa«n. appUcado^ c,»d hc«», - «»' ^ 

5 reference. 

0«r d.. 30^ — •■ has allowed access . U.ge po«io.s of 

r^Lof,^..d»tarcexprcss.dind,e.od,.AsMsinf6n„ado„iBC«^ 

,Or:o.csI«aa,a,.«..es^of n«c.«^o.*^^ 
«a«io„of»ch„i,u«fbrco«badngd»»discas«wmb.c.nsid«ab,yfa.bu«d. 

one pardOar physiologic^ con^do. d.. has considerable «levance .0 ^ "nd ^ 
disease is d. ceDular «sponse .0 hypoxia. The «n •hypoxia" .s .n,e^ » » 
or ^uced oxygen »nsion. ss » -he nonn. ,*ys.o.og,cal 

15 envi«nmentforapartic-toorBanism,whichi,Bn„cd-non„ox.a". 

,„ a variety of h^nan diseases, ceds a. exp^ed » »«iidons o, low oxygen iensio. usu-l, 
!.„Ifpoo.oxygensnpp.,»d..se.se.a^.^ins»n...sneo„^ 

significan. .^^^ role in h«h apop»sis and in angiogenes« a>oncl= ' ^ 

cL, Res 69135.m; b™ « al, 1996, Physiol Kev. 76:839-885; Dor « d, 1997, T«nds 

rr Z 7-289-a94- CanneUet e, * 1998, Namre 394:«5-490). Apop»>sis (see 
20 Cardiovasc. Med., Liw-/:^, v.miut,nw 

,„6.Sci.An«io.n,80-87fo,«view)«.dg»wd.anes.occ„,«hence,lgrow* 

Tviti are .eauced dne u. oxygen ^va.on. An^ogen^ O-C. blo<. vessel grow., 
sill Jon), U sdmnlale. when hy^xygena^ ^s secret d»— 
proliferadon and mi^don of endoMia. cdls in an .»n,t ,0 oxygen l»n«os»s. 
25 (for review see Hanahana* 1996. Cell, 86353-364). 

^baendc *sease p^ologies involve a decrease in d« Mood supply » a hoddy «g»^» 
or body ,»r. generally cansed by cons»icdon or obsrmcdon of me blood vessels. Fc exam^e, 
solid nnnours rypicall, have a ds^gamsed blood supply, leading » 
disea. Condons involving hypoxia include ad^rosclercsis, reb-K^,, 
30 faita^ myocardial irf^on, diseases involving inf^do. of d,e always (su^ as cysbc 
Li s) Id s«*e. Th^fdre, apop»sis and angiogeuesis as in<hK». by *e « 
Ldlarea.soconsider«..o..involvedh,d,esedi^«a.=s^.Usg.erall,^^^ 
,ba. ^Ktosranding dre mechanism b, which «lls «pond » d»se dueases may d» ke, .o 
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tedise«pa*ology«.d*mrde™ttotoe«tr«lni».. 

undergo apopra ..^uted aenes provide potent therapeutic targets. It 

observed that apoptosis and angiogenesis-related genes proviaep 

observea uidi p f . , , , ^tiral role in the selection of mutations that 

has also been observed that hypoxia plays a cntical role m th 

contribute to more severe mmourigenic phenotypes (Graeber et oL, 1996 

10 379(6560):88-91). 

u ■ «vev cell tvpe of the immune system, which has been shown to play a 

^^'z.-irrr';."^- - - — 

15 athaoscleiosB, ibemmtoid aitans (RA), pom" r 

„.ri„h«3l artful ctaoric P>"«»>™^ ^ / 

the macrophage is accepted as being a key cell type P P 

rheumatoid arthritis (Kimie RW er at, 2000 Arthnn. i?e.««n:ft, 2.189 202). this 
rheumaioioarum p a ,t least oartlv attributable to its exposure to 

the aberrant functions of macrophages m RA are at least parUy atm 

hypoxia. . 
u „ • K!«»l 1999 J Le»tocyte Biol. 66: 889-900). Tht macrophage. IS also a key 

^0 cancer, and determining prognosis (Leek RD i:^^". 

Lr. a complete understanding of die transcripdonal response of i^mary hum^ 
. 1H therefore be of great utility in identifying novel therapeutic strategies for 

macrophages, would therefore be oi grc«»i / viral-based 

afceme^ioned diseases. « •^HTlt^ 

„«,exj«ssio. of ke, «a^pdon fec»s which -edia* -he h,pox.c response .« -so 
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required. 

Early in the history of this field it was discovered that a transcription factor, HIF-lalpha, is 
ubiquitously present in cells and is responsible for the induction of a number of genes m 
response to hypoxia. This protein is considered a master regulator of oxygen homeostasis (see. 
5 for example, Semenza, (1998) Cun. Op. Genetics and Dev. 8:588-594). Although HlF-lais 
well icnown to mediate responses to hypoxia, other transcription factors are also known or 
suspected to be involved. Tl^ese include a protein called endothelial PAS domain protem 1 
(EPASl)orHIF-2cx, which shares 48% sequence identity with HIF-laCTian eta/.. Ger^De.. 
\W1 Jan l;ll(l):72-82.). Evidence suggests that EPASl is especially important in mediatmg 

in .«+air cell tvues and it is clearly detectable in human macrophages, 
10 the hypoxia-response m certam ceil types, ana u » wcaii, 

suggesting a role in this cell type (Griffiths et aU 2000, Gem Ther., 7(3):255-62). 

Anumber of genes regdated by transcription factors such as HIF-lahave been identified that 

are implicated in the physiological response to hypoxia. However, there remains a great need 
for the identification of other genes that are impUcated in this condition, in order to develop a 

1 5 spectrum of diagnostic and therapeutic a^nts for use as tools in combating diseases in which 
hypoxia plays a role. Such genes and the proteins that they encode are candidate targets for 
antagonist or agonist agents that modulate human disease states. Furthermore, the idenufied 
genes are associated with regulatory elements that provide alternative and additional candidate 
targets for exploitation for the delivery of gene products in a cell-specific fashion. Any genes 

20 and regulatory elements identified as having a role in hypoxia may be used directly m 
therapeutic appUcations via gene therapy, via recombinant protein methods or via chemical 
mimetics or as targets for the development of agonists and antagonists such as antibodies, 
small chemical molecules, peptides, regulatory nucleic acids. 

According to Ae invention, there are provided novel genes and proteins, that have been 
25 functionally annotated for the first time. Some of these sequences are only identified as 
"hypothetical proteins" in the public databases. Each and every one of these sequences forms 
an embodmient of this aspect of the invention. The invention also includes proteins whose 
amino acid sequences are encoded by a nucleic add sequence recited in vanous ESTs 
deposited in the public databases or encoded by a gene identified from such an EST. At 
30 present, all of these EST sequences are functionally unannotated in the public databases. Each 

and every one of these sequences fomis an embodiment of this aspect of the invention. 

One embodiment of the invention thus provides a substantially purified polypeptide, which 

polypeptide: 
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i, con-pm.. or oo^s-s of to a^ino acid sequence .s in cc of SEQ m 
Nos: 3,5,7,11, 13, 19,21, 23 or Z'; 

« cc^» or coBsis. of a. «-no acid «,ue„ce encoded ^ a nw^eic add 
j,J«ci«di.a.,.ne.fSEQn>Nos: 2,4,6,8, 10. 12. M. 18,20 22,X26 
; a 30. 32. 34 or 36, or encoded b, a ge« idcndfied from an EST .eCed m an, o„e 

oftheseSEQIDNos; 

Ui) i. a fragn^,. of . polypeptide according M) or u), provided *a. said f«gmen. 
^n.abiologioal.edvi.ypos«»»dby*eMllenB*poln«pddeoti)or„),orh.s 

^ .aiiemc deBmin«.t in common with fte polypeptide of i) or n); or 
10 j,)isafnBCtionaleqmvalentof.polyp^4eofi),ii)or(in). 

^ polypeptide sequel recited in SEQ E> Nos: 3, 5, 7, 11, 13. 19. 21. 23 «.d 27w^. 
X ^^present disclo^. unanno,««. in ti« htera^re and p.* ^ data^ 
Lrdin gly -til «>w. no biological fmtcSon ba. been «.Hb„.ed to 
ZJ^^ of tbeae acuenee. ia generally Ubelled in .be da.ab.sea aa . ^ypotbeb^ 
,5 !Z- A novel n^tbod tanned "Snwtomic.', described herein and n, oo^ndrng 
" Cu„nal^..ppUeatio.K.— 8.-now..ncid«eda...og^^on^ 
^ p^ypeptide. in « *e, bav. been fonnd to be diff«e„baDy .^iated nnde, 
physiological conditions of hypoxia. 

^ *e purposes of tbis dconnent. tbe »n -bypoxia- ,b»dd be tal,«, » n«» - 
,0 Tvi^nTof oxygen ten.™ sncb .bat oxyg» -tent is between about 5% a^l* 
(V,v). b, ntos. case, bypoxic tissue wH, bave an oxygen content tbat is less .ban or^ » 
ab..2%.The.enn-norn»xia-sbouldbe.aken»n«ncondibo„seon.pr.smg.no,n»ll^^^ 

riygen br *e «,vironmen. con«med. Ko,nK>» tissue typically bas a. oxygen content 

above about 5%. 

« Tb.se discoveries aUow the deveiopn^m of re^sucb as smaU drug nK.^^-^ 
Ltbeacbvi.yof.beseHypeptides.soal,owinsdiseasesar.pb^olog,c^c^o.^ 

are caused b, bypoxia. or in wbicb bypoxia ba. been '^^^ .'^J^^'^^ 
^veHes also How tbe development of diagnosdo agents ti»t ™ surtabl. I. 

0, bypoxU in biolo^C ti^ and. ti™^ ti» 
30 p„,,,IpUsms(such.sSNPs)witbinge„es.«xli.gfortbepro.eins,mpUcatedi«^^ 

conditions in which hypoxia is impUcated. 
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J^mo^ acdvi^of p-ypepaaes whose sequences «U.«dtaSEQIDNos: 3, 5^7^^ 

^i„epad.has^fouM»Uind,«d™<.ercon«o.ofh,po^.By"hypo«a^^^ 
r«.^......^^.««paae.e^auMs.«.eve.«^ac.U.exp«ed.o.^^ 

5 . 000^ .0 i. ^pcessic ,e«. ^ condido. o, ncnnox.. T^^ 

,1 . ceU . expend » h,^xia conditions as co«pa«d » — n level 

under conditions of normoxia. 

. addi.0. polypepades whose an^o a.d sequence is enoo^ J, a ™..«e acid 
,0 ^».i.anyoneotSEQn,Nos.a,4,6,B,10,,.U lS.20,aX^^30^^^^^^^^ 

36 or encoded by a gene idendfled iiom an EST «cittd m an, on. of d«se SEQ ID Nos h. 
been found to be hypoxia-regulated. 

Tie polypepdd. sconces wbose anuno acid sequence is encoded by a nucleic acid aequ^ce 
^:^«BQIDL2.4,MaMZU.».20.2.24,2a,28,30.32.34and3..<.u*ose 

,5 ltLdaLce.encodedbya,eneide«ieed*o„«,EST«ci»dMan,one„f..^^ 
ZroNosv,ere.lso,pnorU,d»p«««asclosu.e,unanno«edind«h«,a»».ndp^^^^ 

polypeptide sequences. 

aequencea in d^ roop M inu> a nun^e. of differ. ca»gones. The « of ^ 
20 cONrclo^,forwhich.p««i.se,uencehas„o.bee„P«dic»a^d.e^-A-^ 
^ is exp^ssed scp^nce »g (ESD sequen^s d,« are .ep^sen^d u. *e u™0e« 

^..p«bi*.nih.gov/UniGeneO, which con«i. »odcs. o. we* hon>o,o« » 
known p»l When ^nslared, ESTs a« sin^^pass s«*.cncc files of d« 5' .g-on of an 
r .„„™,e as accessed via a force^onedcDNA library. EST sequences 

organism's expressed genome as accessea ™ • i" 
25Jd»b.sho,.«»ia»ag««r..n.>e"" enor^. «nrGe.« (- 
h.p./www.ncbi.in.nih.gov/WeWNews.«/»^.h«n. for review) is an »^ 
Janunnaacau, parfidoning d»se EST sequences in« a non— . ^ of 
Cus^rs. Each UniGene clua^r considered » rcp^en. a u.^ 
gene, as weU as relied info^don such as a» Uss» wes in which d» gene tas been 
30 xpresseda„dn»plocadon.A«c.«^ofhi,sidendiiedb,d>e,.e«,odsdescr,hedh^ 

involves EST sequences d... are co«ai«d in Unigen. clus«rs. W which are no, ann^ 
and exhibit .0 homologies ,0 proteins contained in dte public database. The fcurd. and final 
category encompasses singleton EST sequence entries d»t a« no, i»c«po«ted as ««nes .n 
d,eUnipn»dat.b.seandd»tonlyappearasstaSeent,iesin«»>publlcd.n*«es. 
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A biological function has now been attributed to the polypeptides that are encoded by genes 
incorporating cDNA and EST sequences that fall into the four.categories set out above, in that 
these sequences have been found to be diferentially regulated under physiological conditions 
of hypoxia. Such polypeptides may have an amino acid sequence that is encoded by a nucleic 
5 acid sequence recited in any one of SEQ ID Nos: 2, 4, 6. 8. 10, 12, 14. 18. 20. 22, 24, 26, 28. 
30 32 34 and 36. However, the EST sequences in particular may not be part of flie actiial 
coding sequence for a gene, often representing regulatory regions of tiie gene, or regions tiiat 
are transcribed, but not translated into polypeptide. Accordingly, tins aspect of tiie invention 
also includes polypeptides tiiat are encoded by a gene identified &om an EST recited in any 
10 one of SEQ ID Nos: 2. 4. 6. 8, 10. 12, 14, 18. 20. 22, 24. 26. 28, 30, 32, 34 and 36. 

Polypeptides of fliis aspect of tire invention are intended to include fragments of polypeptides 
according to i) or u) as defined above, provided tiiat tire fragment retains a biological activity 
that is possessed by tiie fiill lengtii polypeptide of i) or ii), or has an antigenic detemunant m 
common witii tire polypeptide of i) or ii). As used herein, tiie term "fragment" refers to a 
15 polypeptide having an amino acid sequence tiiat is tiie same as part, but not all. of an amino 
acid sequence as recited in any one of SEQ ID Nos: 3. 5. 7. 11. 13. 19. 21. 23 and 27. an amino 
acid sequence fliat is encoded by a nucleic acid sequence recited in any one of SEQ ID Nos. 2, 
4, 6, 8, 10, 12. 14, 18, 20, 22, 24, 26, 28. 30, 32, 34 and 36, or an amino add sequence diat is 
elcld^i by a gene tiiat is linked to a nucleic acid sequence recited in any one of tiiese SEQ ID 
20 Nos. The fragments should comprise at least n consecutive amino acids from tiie sequence and. 
depending on tiie particular sequence, n preferably is 7 or more (for example. 8. 10, 12. 14, 16. 
18, 2b or more). Small fragments may form an antigenic determinant. 
Such fragments may be isolated fragments, fliat are not part of or fiised to otiier amino acids or 
polypeptides, or diey may be comprised witiiin a larger polypeptide, of which tiiey fomi a part 
25 or region. When comprised witiiin a larger polypeptide, a fragment of tiie invention most 
preferably forms a single continuous region, For instance, certain preferred embodiments relate 
to a fragment having a pre - and/or pro- polypeptide region fused to tire amino terminus of tiie 
fragment and/or an additional region fused to tiie carboxyl terminus of tiie fragment. However, 
several fiagments may be comprised witiiin a single larger polypeptide. 
30 The polypeptides of tiie present invention or tiieir immunogenic fragments (comprising at least 
one antigenic determinant) can be used to generate Ugands. such as polyclonal or monoclonal 
antibodies, tiiat are immunospecific for tiie polypeptides. Such antibodies may be employed to 
isolate or to identify clones tiiat express a polypeptide according to tiie invention or, for 
exanqjle. to purify tiie polypeptide by affinity chromatography. Such antibodies may also be 
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employed as diagnostic or therapeutic aids, amongst other applications, as will be apparent to 
the skilled reader. 

The term "immunospecific" means that an antibody has substantially greater affinity for a 
polypeptide according to the invention tiian their affinity for related polypeptides. As used 
5 herein, the term "antibody" is intended to include intact molecules as weU as fragments 
thereof, such as Fab, m'h and scFv, which are capable of binding to the antigenic 
determinant in question. 

The invention also includes functional equivalents of a polypeptide of i), ii) or (iu) as recited 
above. A functionally-equivalent polypeptide according to tiiis aspect of tiie invention may be 
10 a polypeptide that is homologous to a polypeptide whose sequence is explicitiy recited herein. 
Two polypeptides are said to be "homologous" if the sequence of one of tiie polypeptides has a 
high enough degree of identity or similarity to the sequence of tiie otiier polypeptide. 'Identity" 
indicates tiiat at any particular position in tiie aligned sequences, the amino acid residue is 
identical between tiie sequences. "Similarity" indicates tiiat, at any particular position in tiie 
15 aligned sequences, tiie amino acid residue is of a similar typetetween tiie sequences. Degrees 
of identity and similarity can be readily calculated according to metiiods known in tiie art (see, 
for example, Computational Molecular Biology, Usk, A.M., ed., Oxford University Press, 
New York. 1988; Biocomputing. Informatics and Genome Projects, Sraitii, D.W., ed.. 
Academic Press, New Yoric, 1993). 
20 Typically, greater ttian 50% identity between two polypeptides is considered to be an 
indication of functional equivalence, provided tiiat eitiier tiie biological activity of the 
polypeptide is retained or tiie polypeptides possess an antigenic determinant in common. 
Preferably, a functionally equivalent polypeptide according to tius aspect of tiie invention 
exhibits a degree of sequence identity witti a polypeptide sequence expUcitly identified herein. 
25 or witii a fagment tiiereof, of greater tiian 50%. More preferred polypeptides have degrees of 
identity of greater tiian 60%, 70%. 80%, 90%. 95%, 98% or 99%. respectively. 
Functionally-equivalent polypeptides according to tiie invention are tiierefore intended to 
include natural biological variants (for exanqile. allelic variants or geographical variations 
witiiin tiie species from which tiie polypeptides are derived) and mutants (such as mutants 
30 containing amino acid substitutions, insertions or deletions) of tiie polypeptides whose sequences 
are explicitiy recited herein. Such mutants may include polypeptides in which one or more of 
the amino acid residues are substituted witii a conserved or non-conserved amino acid residue 
(preferably a conserved amino acid .residue) and such substituted amino acid residue may or 
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may not be one encoded by the genetic code. Typical such substitutions are among Ala. Val, 

Leu and He; among Ser and nr, among the acidic residues Asp and Glu; among Asn and Gin; 

among the basic lesidues Lys and Arg; or among the aromatic residues Phe and Tyr. 

Particularly preferred are variants in which several, i.e. between 5 and 10, 1 and 5. 1 and 3, 1 
5 and 2 or just 1 amino acids are substituted, deleted or added in any combination. Especially 

prcfcned are silent substitutions, additions and deletions, which do not alter the properties and 

activities of the protein. Also especially preferred in this regard are conservative substituUons. 

"Mutanr polypeptides also include polypeptides in which one or more of the ammo acrd 

residues includes a substituent group, 
10 AcooriiDB u> a ted»r .spec, of d,e in«ntion. .to is provided a pmified and iaol.«d nucleic 
.cid molecule m • polypepide according to an, one of d« aspecB of invention 

diseased above. Such a nncleic add molecule may consist of a nucleic acid sequence as 
.eci-ed in any one of SEQ ID Nos. 2, 4, 6. 8. 10. 12. 14. 18, 20, 22, 24, 26, 28, 30, 32, 34 and 
36 or fcrm a r«iundant ecr^valent or ftagment drereof. TOs .spec of dK= inventa also 
,5 in^todes a purified nucleic acid molecule which hydridi»s under hi^ srtngency condtaons 
with a nudeic add molecule as described above. 

According to a ftrther aspect of the invention, there is provided an expre^ion vector that 
contains a purified and isolated nucleic acid molecule according to the aspects of the mventron 
described above. The invention also incorporates a delivery vehicle, such as a hposome. 
20 comprising a nucleic acid according to the above^escribed aspects of the invention. 

In a further aspect, the invention provides a host cell transformed with a vector of the above- 
described aspect of the invention. 

In a still further aspect, the invention provides a ligand that binds specifically to a polypeptide 
according to the above-described aspects of the invention. The ligand may be an antagomst 

25 Ugand that inhibits the biological activity of (he polypeptide, or may be an agonist ligand that 
activates the hypoxia-induced activity of the polypeptide to augment or potentiate a hypoxia- 
induced activity. The tem. "Ugand", as used herein, is used broadly and is intended to 
encompass natural or modified substrates, enzymes, receptors, small organic molecules such as 
small natural or synthetic organic molecules of up to 2000Da, preferably 800Da or less. 

30 peptidomimetics, inorganic molecules, peptides, polypeptides, antibodies, structural or 
functional mimetics of the aforementioned and all other types of chemical entities that act m 
the manner specified. 
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In a still further aspect of the invention, there is provided a ligand which binds specifically to. 
and which preferably inhibits the hypoxia-induced activity of. a polypeptide according to any 
one of the above-described aspects of the invention. Such a ligand may. for example, be an 
antibody that is immunospecific for the polypeptide in question. 

5 According to a further aspect, the invention provides a polypeptide, a nucleic acid molecule, 
vector or Ugand as described above, for use in therapy or diagnosis of a disease or abnormal 
physiological condition. Preferably, the disease or abnormal physiological condition that is 
affected by hypoxia; examples of such diseases include cancer, ischaemic conditions (such as 
stroke, coronary arterial disease, peripheral arterial disease), reperfusion injury, retinopathy, 

10 neonatal stress, preeclampsia, atherosclerosis, inflammatory conditions (including rheumatoid 
arthritis), diseases involving infections of the airways (such as cystic fibrosis) and wound 
healing. The undesired cellular process involved in said diseases might include, but is not 
restricted to; tumorigenesis, angiogenesis, apoptosis, inflammation or erythropoiesis. The 
undesired biochemical processes involved in said cellular processes might include, but is not 

15 restricted to, glycolysis, gluconeogenesis, glucose transportation, catecholamine synthesis, iron 
transport or nitric oxide synthesis. 

According to the invention, a protein known to be implicated in the biological response to 
hypoxia has now been found to be regulated by HIFla. Certain fimctions of this protein are 
known, meaning that these functions have been annotated in the public databases. The 
20 sequence of this protein is presented in SEQ ID No: 37; the encoding gene sequence is 
presented as SEQ ID No: 38. 

According to a fiirther aspect of the invention, there is provided a substantially purified 
polypeptide, which polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 3, 5, 7, 
25 11. 13. 19, 21, 23, 27 or 37; 

ii) has an amino acid sequence encoded by a nucleic add sequence recited in any one 
of SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, 18. 20, 22, 24, 26, 28, 30. 32, 34. 36 or 38. or 
encoded by a gene identified from an EST recited in any one of these SEQ ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment 
30 retains a biological activity possessed by the fiiU length polypeptide of i) or u). or has 

an antigenic determinant in common wilh the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii); 
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for use in the diagnosis or therapy of tamourigenesis, angiogenesis, apoptosis, the biological 
response to hypoxia conditions, or a hypoxic-associated pathology. 
The invention also provides a purified and isolated nucleic acid molecule that encodes a 
polypeptide accoixling to this aspect of the invention, for use in the diagnosis or therapy of 

5 tumourigenesis. angiogenesis, apoptosis. the biological response to hypoxia conditions, or a 
hypoxic-associated pathology. The sequences of these molecules are provided in SEQ ID Nos.: 
2, 4, 6, 8, 10. 12, 14, 18, 20, 22. 24. 26, 28, 30, 32, 34. 36 and 38. As described above for the 
EST nucleic add sequences annotated herein, this aspect of the invention includes redundant 
equivalents and fragments of the sequences explicitly recited in SEQ ID Nos.: 2, 4. 6, 8, 10, 

10 12, 14, 18, 20, 22, 24, 26, 28, 30, 32. 34. 36 and 38. and purified nucleic acid molecules which 
hybridize under high stringency conditions with, such nucleic acid molecules, and vectors 
containing such nucleic acid molecules for use in the diagnosis or therapy of tumourigenesis, 
angiogenesis, apoptosis, the biological response to hypoxia conditions, or a hypoxic-associated 
pathology. 

15 This aspect of the invention also includes ligands which bind specifically to, and which 
preferably inhibit the hypoxia-induced activity of, a polypeptide Usted in SEQ ID Nos.: 3, 5. 7, 
11, 13, 19, 21, 23, 27 or 37, or encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos: 2,' 4, 6, 8, 10, 12. 14, 18. 20, 22, 24. 26, 28. 30, 32. 34, 36 or 38. or encoded by a gene 
identified from an EST recited in any one of these SEQ ID Nos. for use in the diagnosis or 

20 ther^y of tumourigenesis, angiogenesis, apoptosis, the biological response to hypoxia 
conditions, or a hypoxic-associated pathology. 

The invention also provides a pharmaceutical composition suitable for modulating hypoxia 
and/or ischaemia. comprising a therapeutically-effective amount of a a polypeptide, a nucleic 
acid molecule, vector or Ugand as described above, in conjunction witii a pharmaceutically- 
25 accq)table carrier. 

The invention also provides a vaccine composition comprising a polypeptide, or a nucleic acid 
molecule as described above. 

The invention also provides a method of treating a disease in a patient in need of such 
treabnent by administering to a patient a therapeutically effective amount of a polypeptide, a 
30 nucleic acid molecule, vector, ligand or pharmaceutical composition as described above. For 
diseases in which the expression of flie namral gene or the activity of the polypeptide is lower 
in a diseased patient when compared to tiie level of expression or activity in a healthy patient, 
tiie polypeptide, nucleic add molecule, Ugand, compound or composition administered to the 
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p^.„t be an agonist For dis«s» to which to cxpessio. of fte nannal gene or 
acdvi., of polypepdde is h,gh« to . dis««d pati«« «hen con^ » *c level of 
cxpessio. or activity in a healthy padent, the polypepUdc, nnddc acid molecule, vector, 
ligand, coniponnd or composition adnums..«d to the patient is ». antagonist By the tenn 

5 "agonist" is meant herein, any polypeptide, pepdde, synd,etic molecnle or org^tic molecnle 
that ftorfons as an activator, by inceastag the efetiv. biological »»ivity of a polypepttde. 
for example, by toceastag ^ expression or enzymatic acdvity. By the tenn "amagonist" « 
mean. be»in. an, polypeptide, peptide, synthetic molecnle or organic molecde that hmCons 
as an inhibitor, by deceasing the effective biological activity of the gene prodnct for exan^le. 

10 byinMbittaggeneexptessionofanen2jmeoraphaimac<Jogical.eceptor. 

m inv«,flon also provides a poln>eptide, nuclae acid molecule, v^Mor, ligand or 
phatmaceutical eompositio. according to any one of the .bove<tescrib«i »pects of the 
mvention. fo, use to U,e manufacnue of a medicament for the treatment of a hypoxia-tegulated 

condition. 

15 The invention also provides a method of monitoring the therapeutic treatment of disease or 
physiological condition in a patient, comprising monitoring over a period of tmie the level of 
expres^on or activity of a polypeptide, nucleic acid molecule, vector or ligand accordmg to 
any one of the above-described aspects of the invention in tissue from said patient, wheiem 
altering ^d level of expression or activity over the period of time towards a control level is 
20 indicative of regression of said disease or physiological condition. 

The invention also provides a method of providing a hypoxia regulating gene, an apoptotic or 
an angiogenesis regulating gene by administering dim^tly to a patient in need of such therapy 
an expressible vector comprising expression control sequences operably linked to one or more 
of the nucleic add molecules as described above. 
25 THe invention also provides a metiiod of diagnosing a hypoxia-regulated condition in a patient, 
comprising assessing tiae level of expression of a natural gene encoding a polypeptide 
according to any one of tiie aspects of tiie invention described above in tissue from said patient 
and comparing said level of expression or activity to a control level, wherein a level tiiat is 
different to said control level is indicative of die hypoxia-related condition. 
30 Such a method of diagnods may be carried out in vitro. One example of a suitable method 
comprises tiie steps of: (a) contacting a ligand as described above with a biological sample 
under conditions suitable for the formation of a Ugand-polypeptide complex; and (b) detecting 
said complex. 
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A further example of a suitable method may comprises the steps of. a) contacting a sample of 
tissue from the patient with a nucleic acid probe under stringent conditions that allow die 
formation of a hybrid complex between a nucleic acid molecule whose sequence is redted in 
any one of SEQ ID Nos.: 2, 4. 6. 8. 10, 12. 14. 18, 20. 22. 24, 26, 28, 30, 32, 34 and 36. and the 
5 probe; b) contacting a control sample with said probe under die same conditions used in step 
a); and c) detecting die presence of hybrid complexes in said samples; wherein detection of 
levels of die hybrid complex in tiie patient sample tiiat differ from levels of die hybrid complex 
in die control sample is indicative of die hypoxia-related condition. 
A still further example of a suitable metiiod may comprise die steps of: a) contacting a sample 
10 of nucleic acid from tissue of die patient widi a nucleic acid primer under stringent conditions 
fliat allow die formation of a hybrid complex between a nucleic add molecule whose sequence 
is recited in any one of SEQ ID Nos.: 2, 4, 6, 8. 10. 12, 14. 18. 20. 22, 24, 26, 28, 30. 32, 34 
and 36. and die primer; b) contacting a control sample witii said primer under die same 
conditions used in step a); c) ampUfying die sampled nucleic acid; and d) detecting die level of 
15 amplified nucleic add from bodi patient and control samples; wherein detection of levels of 
die ampUfied nuddc add m die patient sample tiiat differ significandy from levels of die 
amplified nuddc add in die control sample is indicative of die hypoxia-related condition. 
A still further example of a suitable metiiod may comprised die steps of: a) obtaining a tissue 
sample from a patient being tested for tiie hypoxia-related condition; b) isolating a nucleic add 
20 molecule according to any one of die above-described aspects of die invention from said tissue 
sample; and c) diagnosing die patient for die hypoxia-related conation by detecting die 
presence of a mutation which is assodated witii die hypoxia-related condition in die nuddc 
add molecule as an indication of die hypoxia-related condition. This metiiod may comprise die 
additional step of amplifying the nucleic add molecule to form an amplified product- and 
25 detecting die presence or absence of a mutation in die amplified product. 

Particular hypoxia-related conditions diat may be diagnosed in diis fashion include cancer, 
ischaemia, reperfusion, retinopatiiy, neonatal stress, preedapmsia, adierosclerosis, rheumatoid 
arthritis, cardiac arrest or stioke, for example, caused by a disorder of die cerebral, coronary or 
periph^ circulation. 

30 In a ftirther aspect, die invention provides a mediod for die identification of a compound diat is 
effective in die treatment and/or diagnosis of a hypoxia-regulated condition, comprising 
contacting a polypeptide, nucleic add molecule, or ligand according to any one of die above- 
described aspects of die invention widi one or more compounds suspected of possessing 



PCT/GB02/00817 

WO 02/068466 

13 

binding affinity for said polypeptide, nucleic acid molecule or Ugand, and selecting a 
compound that binds specifically to said nucleic acid molecule, polypeptide or ligand. 
According to a still further aspect of the invention, there is provided a kit useful for diagnosing 
a hypoxia-regulated condition, comprising a first container containing a nucleic acid probe that 
5 hybridises under stringent conditions with a nucleic acid molecule according to any one of the 
aspects of the invention described above; a second container containing primers useful for 
amplifying said nucleic acid molecule; and instructions for using the probe and primers for 
facilitating the diagnosis of the hypoxia-regulated condition. The kit may additionally 
comprise a third container holding an agent for digesting unhybridised RNA. 
10 To faciUtate in the diagnosis of the hypoxia-regulated condition using one of the methods 
outUned above, in a further aspect, the invention provides an array of at least two nucleic acid 
molecules, wherein each of said nucleic acid molecules either corresponds to the sequence of. 
is complementary to the sequence of, or hybridises specifically to a nucleic acid molecule 
according to any one of the aspects of the invention described above. Such an array may 
15 contain nucleic acid molecules that either correspond to the sequence of, are complementary to 
the sequence of, or hybridise specifically to at least 1-18 or more of die nucleic acid molecules 
implicated in a hypoxia-regulated condition as recited above. The nucleic acid molecules on 
the array may consist of oligonucleotides of between twelve and fifty nucleotides, more 
preferably, between forty and fifty nucleotides. Alternatively, the nucleic acid molecules on tiie 
20 array may consist of PCR-amplified cDNA inserts where the nucleic acid molecule is between 
300-2000 nucleotides. 

In a related aspect, again usefiil for diagnosis, the invention provides an array of antibodies, 
comprising at least two different antibody species, wherein each antibody species is 
immunospecific with a polypeptide impUcated in a hypoxia-regulated condition as described 

25 above. The invention also provides an array of polypeptides, comprising at least two 
polypeptide species as recited above, wherein each polypeptide species is impUcated in a 
hypoxia-regulated condition, or is a fimctional equivalent variant or firagment tiiereof. 
Kits useftil in the diagnostic methods of tiie invention may comprise such nucleic acid, 
antibody and/or polypeptide arrays. 

30 According to tiie invention, a kit may also comprise one or more antibodies tiiat bind to a 
polypeptide as recited above, and a reagent usefid for tiie detection of a binding reaction 
between said antibody and said polypeptide. 
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AcooAg » . «iU fWher .pec. o, invendo,, >h«e is p^vided a ge„^call,-n>od^^ 
.o„-h J- a.- b^n ^sfonncd » exp«ss U^. ^ ' 

said gsuMicaUy-modifitd animal is a transgenic or taoAout animal. 
5 Tbe invent also provides a m=d»d for sceedng for a conrpound rffcOive » «e« a 
hvpoxia-regnl^^ condidon, by conUcUng a ,»n.h«na., g«,«ically-»o«.d an^d as 
abov. wia a can«a« con,ponnd and deling d» ef«« of componnd on 
the physiological state of the animal. 

The of present invention will en^lo,. >n.l«s othervdse indicated, conventional 

,0 .ecb,.i,ues of .>K.lecnl.r biology, nucroWoiog,, recombir^nt DNA technology >M 
immunology, wbich are v«flnn the skill of diose wotWng to die art. 

Mos. g^ier.1 molecular Wogy, microbK^ogy tecombinant DNA teclmolo^ ^ 
illogical ^hni,^ can be fcund m Samb^olt « ^. Molecular Qonmg A 
Manual (1989) Cold Harbor-Ubo,«o,y Press. Cold Sprtog Harbor, N.Y. or Ausubel « oL. 
15 Current protocols in molecular biology (1990) lobn Wiley mi Sons, N.Y. 

Urtessdei,.edod««ise.dl.eclmicala„dsdendflc.e,msusedlie«inhave,hesa™ 
a3comn»idyunderst„odbyo.eoforaina„skmi„U»ar..„wUchdiisinv.ntioobelongs. 

A. Polypeptides. 

Polypeptides 

20 The tenn >lyp.pdde" as used herein, ref^ -o a d-i (may be branched or unbnmched) of 
™o or more amino acids littei^eachod,. by n^ansof.pepdde bond or modified pepnde 

bond (isosteres). The term polypeptide encompasses but is no. 1imit«. to oUg^des, 
pepddesa.dproteins.Thepolypeptide of tbeinvento may addidonally be either inaman^ 

^.ein fon. or to a pra-, pro- cr prep..p.«ei„ to -bat re<^ sub«pe« deava^ f^ 
25 fonnadon of .he acdve ma»re protein. pre-, prr., prepro- par. of the pmtein is often a 
leader or secraory se,««i« bu, may also b« «i addittonal sequence added u> aid protein 

pudfication (for example, a His tag) or to confonn a higher stabili^, to die ptotem. 

A polypeptide accordtag to d» invendon may also include modified ammo acids, dia. is^ 

aMtoo acids odier dian diose 20 dia, are gei^^ ™s •-"'^ '^^ •>« « 
30 naruralprocessessuchaspost-^aiisladonalprocesstagorbychemiealmcKiificatio. 

of modificadons mclude a^tyUdon, acyWon. amidation, ADP^bosyMoa. arginylabon. 
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o, a Upia dertvasv. or pbosphaddv.inosi.ol r^.y^ a«acta^, 
. or baeme ™e» « n.c.c«ide . ». cyc,«oa deme%laUO^ 

.isulphi* tad fomadc fo^aon of «.vale.. c«s.Ud=^ fbrmylation, gl,co.yla«cn. GPI 
^ fonnaaon, hyd^xyl^on, iodina^on, Upid amch».t n«d.,.ado„, n,yns,oyU^on, 
5 oxidadoB, po^olydc p«essiBg, pho^lado., n, «»on, 

^phaSoB. aBd *quitotioB. ModifioadoB of fto pol«.=pddc caB occur aBywhae «.ttuB 
^lecdc b^lodiBg bactbOBC, d,e aBiBO acid sidc^BS or a. d« N- or C-«n,Bala. 

A polyp^pdde accordiBg u. d>e iBvendo. eitf.=r be isoIa»d tom Banral (f- 
ex^Bplc purified from cel. cul^re), or i» . r«OB*iBa„d, produc«l polypcpUde, or a 
10 ^ihericallyproducedpolypcpddeoiacombiBMioBOfalldKaboyc. 

Antibodies 

A polypeptide accordiBg .0 d.e iaveBdoB, i« tacdoBal =^vale»« m>ic any iBBBunogenic 
frZ-^ derive *e polypeptide B.y be B»d » geoera^ Uganda rnc^Bg 
-BOBOdona. or pCycIoual andbodie. or a.«-body ftagB«BU. Tbeae audbod^es 
,5 ca. to be uaed «. i.ola» or ideBdfy cloaca expressiBg d« polypeptide of d» iBveBtion or » 
purify *e polypeptide b, affimty c.^B»»gr.phy. Brrte uses of *ea. iB»BUB0apec,6c 
Lboaes ™., iBcMe. but are bo. liBB«d diagBoatic, d«rap«.tio or geBeral a«y 
appUc^ioBa. Exan-plea of aasay .«hBi,»ea *a. e»p.<^ a«ibodiea are imBruBO^ys, 
rIoirBB,.Boaasaya(RIA)or.BzyB.eUBkedin™»»aort««aasay(BUSA).Ind,eaec.sea.*e 

20 aBdbodiea nuy be labelled vrid. aB aBalytically^«Uble reageBt iBcludiBg radio,ao.ot«s, a 
BuoresceBt rBolecule or any reporter BKjecule. 

The term •■in^uBO^fic" as used hereia ref«a .0 aaibodiea « have a subaraBdally higher 
aiBBity for a poln«pdde of .Ms invctioB compared witt. od« polypeptide.. The tenB 
...Bdbody aa „«d h«iB rfera B a moleode .ha. is produced by aBimala Ib reapoBae U, a. 
25 aBtigen aBd haa ti. particuhr proper., of iB^raetiug apedficany «iti. d,e ^dgeuie 
de«nBi.aB. to. iBduced formation ftap»n« °f aforeB,e»tio»=d molecde sud. aa 
Fab m-h aud acFv, »hich .B= capable of WadiBg a« aBtigeu deB.BU„aBt a.^ alao mduded 
iB me ttHB "aBttbody". Antibodies B«y also be B»diiied «. make chimeric anObod.es, whe» 
«,„.humaB variable regions rejoined or fused u, human conaum. regioBs (fo, example, e. 
30 4 PNAS.USA,84.3439 0«i7)).Pa.ticularly,.Btibodiesma,bemodified«>makeftemless 

ta^^nogeBlo .0 aB iBdividual in a process sud, as hmnaBisatioB (see, for ^^^^^ 

N«»r., 321, 522 (1986); V«hoeyen e, cL. S<M^ 239. 1534 (1988); Kaba. e. «t, . 

taBnmol.. 147. 1709 (1991); Queen « 4, FNAS. USA, 86. 10029 (1989); QonnaB « at. 
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5 !n «^v.e.. a^»o acids of a ta™. an.^,. * — «d M.^, 

M„g do«>ains. each domata betas dim»d against a diftrent epitope. 

Speciac pdydonal a^ibodies may I. ~de b, tenut^^hallenging an «™al wid. a 
.„ r^tt^ f.bistave^o„.Co™.o„a.^s.sea,o,.bept^c.<»ofan^»^^ 
reruse,*cUcke..r,bbi.,goat.ndbo,se.Tbepo.,^.-deused,oi™™«o.^^^^ 
!^ . y t« deHved by tecon*i»a« DNA t^nology or .ay be c>,e».cany.,n.^aed. I„ 
::o.:po,,pepdd=..,bec<..sa«d.oaca..p.t.„.C— .y»^^^ 

polypepddes may be conjuga^d tad.de, b« a« no. Itauted ,0 BSA (bo™ 
. a,b.-.,,Uob.ta.„d.ybo>eampe.bae»«y.ta..Sen..*»^ 

^ H collected a»d »ated acco*, to la«,wn procedcres, foe example, by 
immunoaffinity chromatography. 

Soedficmonodonalandbodlescang^^rallyben^bymahodsknownmo^skUledtate 

r:::ex.^le.K0blet,0.a.dMns.ta,C.,Na.«m4..,7a^^^^ 

20 — gy Today 4: 72 («83,-, Cole « ^, 7,-96 ta M— ~ ^ 

Tle^py Alan IL Uss. Inc. (1985) and Roitt, I « ^. —ogy. 25.10. Mo*y-Y.ar Book 
B„Jum»ed(l,93,).P.ne.ofmonoclonala«ibodlesp,odncedagataa.,bepo,,^^^^ 

^ inven^on can be screened vaHo. prope«les, i.e., fo. isotyf., ^^^^ 
against »bicb tbey a. di.ec.ed. A—,. ge«s encoding .be «.onodo^ anbbod^^ 
25 i.L.maybeisoU^df^mbybrtdom.s.foctasUncensingPCR^cbni^h.--'^* 
and cloned and expressed in appropriate vectors. 

P,agedisplay«cb„o,ogymaybe»aised»select««ge„esencodtagd.ean«bo^^ve 
e«i«da„ta™unos^c«sp^»*epo.,pep«desofU.tav^.on(s.^^ 
« ^. (1990). N.t« 348, 552-554; Ma*s. I. e, ni.. (1992) Bio«h»logy 10, 779-783). 

30 Ligands 

Thepdypep«desofa»tave«ionm.,.lso^nsed«.sea«*forta««c«ngligands»*d.o^^ 
doing ftis inctade d,e « of a lib^y of compounds (see CoUgan « cL, Cnne« 
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P^tocols iB ta-olog, 1(2); aap»r 5 (1991), isolaSng *e Uganda fton. cdU iaobdng ^ 
a cel,-.^ preparad»» o, .a»ra. ^c. <0 <^ V^ 

4 aodva« (.gonis.) or inhibit (anUg^is.) i« acd^.,. «vcl,, compounds .««^ 
o, .h. pclywade p,«en. in ^ * including aff=«ing gene «press.on a„d/cr 

5 mRNA stability. 

Uganda .0 polypepUde fom, a W.. asp^ "f ^ - " " ^ ^ 
above. P^fcned "anugonis." Ugand. include d,<»e *a. bind ,0 to polypepnde of flns 
i„ve„,<„ and »ongly inhibit an, aetivit, of to poly^pdde. Preferted -.goMst Ugand. 
taclude tose d«. bind to to po.«ep«de and ind«. activit, of to polype^^ 

,0 tW. invendo. o, inc«aaes snbstantiaHy to level o, to «de to cell. As defln»^ 
above to tetm "agonisC is n«ant to inO.de any polypepdde, pepdde, syntone nolecnle « 
o^ani'c n-oleoule that ..notions as an acdva.^. by increasing to eSccdve 
„, a polypepdde, fb, ex^le. by inceasing gene expression or enzynatic acbv.^. The tern, 
-antagonist is n«ant include ^ polyp^de, peptide, ,ynd,e«c molecule or organ,c 

15 nK,lec„.e tot tactions as an inhibitor, ^ d^reasing to eftcttve biological acbvrty o, to 
g«„ product, tor exarr^le, b, inhibidng gene expression of an en^ or a pham«co,og,cal 
receptor. 

Uganda to a polypeptide according to to invendon may conte in various tans including 
^ or ^ substrates. enzym«, -.11 or^c n»Iec»les such as small 

^t™, or syntodc organic molecules of up to preferably 800D. or less, 

pepdtonimedcs, inorganic molecules, peptides, polypepddes, anybodies, stn.c«ral or 
functional mimedcs of the aforementioned 
B. Nucleic add molecuies 

Prefened nucleic add motales of to inv«io. are toae «bich encode to pol^de 
25 se,ue„ces.ecitedinan,oneofSEQn>Nos.3.5,7,l.,13,19,2.,23or27^orwhid.e.»* 
polypepades^^odedbyanucleic add aequenceteci^di. any one ofSEQIDNos: 2,4,6.8. 

10 12. W, 18. 20, 22. 24, 26, 28, 30. 32, 34 or 36, or encoded by a gene id«>tified from » 
EST redted in any one of toseSEQn)Noa.E«mplea of such nucteic add molecules .ndmfe 

toa. listed in SEQ fD Nos. 2. 4, 6. 8, 10. 12. 14, .8, 20. 22. 24. 26, 28, 3a 3i 34 and 36 
30 homologous nuddc adds and nudeic adds tot are completontary ,0 tose mrdetc ac,d 
molecules. Nuddc add molecules of this aspect of to invendon may be used in nutor«s 
^ and applicadons. a, descdbed generdly hetdn. A m«ldc add molecule prefcr*ly 
comprises of a. least » consecufive nudeoddea faun an, one of to sequences disdoa«l m 
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. « ,n 19 U 18 20 22 24. 26. 28. 30. 32. 34 and 36, where n is 10 or 

probing purposes). 

, nl^^ linked to a peptide backbone of ainino acid residues, preferably 
least five nucleotides in length bnked to a pep 

• , ■ Th. tenranallYsine confers solubility to the composition. PNAs may 
ending m lysine. The terminal lysine deferentially bind complementary 

pegylated to extend their lifespan in a ceU. where they preferenual y P 
,5 Ze-stranded DNA and I^A and stop transcript elongation (Nielsen. P.. et al. (1993) 
Anticancer Drug Des. 8:53-63) . 

,5 1 1.. o., v^.. -.0 acid *a. code =a» 

rm««c pdypcp^dc) as m cxplidtt, idcndflcd, *a. code fc. a 6ag,»=n. of dK 
':,riL^.^af«o»le.pivaIcn.ofd»po>,pcpUdeord,a.c.-cb,a.^^. 

: rlc ona. cuiva... o, d. po.,p.pddc. Also W«d«. d^s as,»c. of d» ™vc^ 
Itl. n„..c a.d „»,.c..s ^^^^^ -coddc s..sd»o. «^a. 
30 «na„6=«a« or taerto^ or comWnadons of d>» *r««,«o«d. So* mol««l« 

1 as *ose crea^d by ch^ical »u»8=-s. or varia«s iso.a»i ftom a s,«ccs ceU « 
^.L^P. dun d» ^ w..* d» scuen^ e^BCd, ide»dfled o„.,»«d. 
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Varia„...d*.cid»olecules™yda«ft™.h=nucl«addmolecaleexpUafly«ci»di„a 

Nucleic acid ™lcc^es may ^so W»de addition] nucleic acid ^ » « expliciU, 
^W, for cxa^le. a. a>e 5' o, 3' end of .he n»lec„le. Such addidonal nucleic acids May 
5 encode for a pdypepUde »i.h added toctton.H„ compared wid, ^ oHginl polypep^d. 
WK0« ae^« is explicidy identified ^ An example of to would be an addraon of . 
seouenc. U». is hew>logous u> te ohginal nucleic acid s»,uence. fo encode a fusion prorern. 
Suchafudonpro«inn>ay be of use inaidngpurifrcationprocedures or enabling technique.^ 

be carried ou. »he.e fusion pro«ins are .e,uir«l as in yeas, two hybrid sy«e.n). 
10 AddiSoml sequences ma, also include le»ier or secrecy sequences such as rhose codmg for 
p,^,pre- or prepro- polypeptide seq«nc«. These additional sequences may also include non- 
coding ^quences that are «mscribed but not translated includiuB rihosome binding srtes and 

tennination signals. 

Anucldc acid molecule of the invention may include molecules that are at leas. 70% identical 
15 over tireir entire lengdr to a nucleic add molecule as explicitiy identified her«n in SEQ ID 
Nos.: 2, 4, 6, 8, 10, 12, 14, 18, 20, 22, 24, 26, 28, 30, 32, 34 or 36. P«ferably, a nuderc and 
molecule according to titis aspect of tire invention comprises a region drat is at least 80% 
identical ov« its entire lengti. to a nucleic acid molecule as expUcitiy identifi«i herdn in these 
SEQ ID Nos., preferably « least 90%, more preferably a, least 95% and most preferably at 
20 least 98% or 99% identical. Further pr*rred embodiment include nucleic acid molecules d>at 
encode polypeptides that retain substantially the same Mogicd function or «»ivi.y as the 
polypeptide explicitly identified herein. 

The nucleic acid molecules of the tovention can dso be «>ginee.ed usirrg medrods genendly 
known in the art These methods include but are not limited to DNA shufllmg; random or non- 
25 random fctgmentation (by restriction enzymes or shearing methods) ^ reassenMy of 
insertions, deletions, sobstitirfions andre^rangements of sequeocesby site^rected 
mutagenesis (for example, by PCR). These altenrtions may be for a number of reasons 
including for ease of domng (such as imroduction of new restriction sites), d.enr« of 
glycosylaSon pa.«:ms, changing of codon preferences, spUce variants changing tiK processing, 
30 andyor expression of the gene product (the polypeptide) In general or creating fusion ptotems 
(see above). 
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Hybridisation 

5 ^ coding n..Hc .id — - "^"^^^^ „f .U. 

a991);O-C<»u.or,I.Nem><tea56,560(1991).Ue« . 

^<A/ioRRvnervaneffl/.. Science 25 1,1 JWi.ii's'i;- 
10. Cooneycfai., Science 241, 456 (1988), Dervaner 

15 placedtogetherunderconditionsconducivetobindmg. 

. u • tAOor in a solution comprising 50% fonnamide,5XSSC(150mMm 
ovenughtmcubaUonat42 CinasoluUon p . 

w J- ^♦rot*.^ -^OmM sodium phosphate (pH7.0), 3x ueiuidiuu 
ISmM tnsodium citrate), 5UmM soai f f followed by 

d.^ .iph... and 20 ^crogran^p. de^ ^^'^ ^ 

^ wa^nsd«m««inO.«SSCa.a^xin»».y«^.l^"-»S«.=^-~ 
hybridisatton reaflioli bang earned ml at 33 i. ks«c o 
condidom med for hybridization are d.o« of Ugh »UW- 

some ^ and CaCing ,.c»rs that ™, afl^ *e hilling of ^ ""^^ 

lde^^.*aaeco,np.ido.<CK:p.i«^.ve.„-'^^-^^^ 
30 re,nire,noreenerg,»^*a.ATpair.)a.dd»chenUcaIe.v,ron.^^ 

lovalen, cadons C«.ch aa Na*) atabiUaea *.plex fcm^tion ^ « 
lhaa,or..».-deandn.eade-i«ae.had.,e,b,dia.npti.nof*eh^-^^ 
ccnponnd. a«ch aa pol,«b,le«a glycol (TCG) can i»:.«se ,««oca..o. apeeda by » 
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reagent c BLOTIO « che^cl ag^n. of«. added » block nonspecific anactoen. ^ 
^lpba..od„»«dsnppo.^..S*e«»p.ra»„w„U>soi^*es„^^^ 

J^on, as «iU inc^ .be « o, *c wasbing cond.«ons M^n 
5 hybridisation (Sambrookef a/. [SMP«»])- 

Nn^ron. «bni,nea ^ for eftCng b,bridis.«on o, nuCeic acid „ol«.«. Sncb 
^ naually involve on. of -he nndeic acid po^..«ons being labeUed. Ubemng 
methods b„. a. no. Wted » radiolabelUng, fl-o«sce^ la^bn^ 

efce-n^escen. c cb.>n,oge.ic la«ng o, d^nncaliy conpUng '^'^'^ 
,0 „„,.cu..»an.cl.oddep.cn«oraucb.a»»bioti.-».p.avidin.,a«n.Tbascanbc^^^^^^ 

obgolat^Uing. nick-nan^on. «,d-laMling o, PCR amplificadon n.ng a labelled 
Jy^U^. Ubemns of RNA .noWes can ^ .bieved b, cloning d» se,n^ 

niding d,e polypep^de of *e invenSon i«o a v^r apedflcall, fo. .b. 
vec.o.a«lcnowninU«a«andmaybe.sedu>ay«beai«RNAp«l-.nv«»by,beadd.«on 

,5 ofanappropda«RNApol,«er.ae™cbasT7,T3cSMandlabellednncleondes. 

Various IdU are connne«daBy available *a. liow ^ labeling of n»l.c^es. E«jn*les 
inclnde .hose made by Pbarmacia Upjohn (Kalama^o. MD; Pron»g. (Madison M); »d 
fte U.S. Bic^bonncal Corp. (Qevdand. OH). HyWdisadon aasays inclnde. bnt a« n« ta»d 
„ do.-b.oB. So«bem blotfng, Nortben, blcoing. cbromosome in h,bnd,sabon (for 
20 example, FISH [fluorescence in sU. hyWdisabon]), dssne in .in. bybridisadon colon, 

plaque lifts, gridded done hybridisadon assays, DNA nncroarrays and Cgonucleob^ 
Lcniarray. Tbese bybridisadon meftods ^ od-ers, ™y be used b, . skilled arUsan » 

iscla* copies of genomic DNA. cDNA, or RNA encoding bomologous or o,*o.«goos proB,ns 

from Other Species. 

25 1^ invention Aerefore also e.*odies a ,«cess for de«dng a nucleic acid mole«.le 
.ccoding «. to inve«ion. compriing «e s»ps ot (a) con^cdng a nucleic probe ««h a 
Molopcal sample .»Jer hybHdi^ng conditions » Ibnn doplexes: and (b) detecting »y su* 
duplexes tira, are fbnned.Tte term >>te" as used herdn refer, toanucleic-ddmolec* 

a bybridi^tion reaction v,hose molecular identic is know, a,^ is designed s^fically » 
30 identify nucleic adds encoding homolog^s genes in otirer species. Usudly, d« 

population is d» labelM population, bu. dtis is no. always d» case, as fb, example, m a 

reverse hybridisation assay. 
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one exa^le of a « of a pr<*e is ,0 find nucleic acid — ,«* an e^val™. «m«ion 
.„«„,« ta. a» expUcid, idendficd h«ei«. or ^ idendf, add^onal fannl, member 
same or Ota species. This be done b, probing Ubraries, s.ch as genome or cDNA 
Ubrades, derived from a s<^e of tattcest, sucl, as a human, a »>n-human anhnal. o|her 
5 eukaryo^ species, a plan, a prokaryodc s^ies or a virus. The probe may be na»,al or 
anifidlly deigned using meAodsreco^sedi. tear, (for example. Ausubel««i. 
A nucleic acid probe ™ll preferably possess g.ea« fl»n 15, more preferaN, grea^r to. 30 
and mos. preferably greater d«n 50 condguous bases complemen«ry » a ^-oleic ac,d 
molecule explicifly identified herein. 
,0 h ™ny cases. isola»d DNA from cDNA Ubraries will be mcomple» in *e region enco*ng 
d>e polypeptide. »™ally « d« 5' end. Med»ds available ibr ».bse<pendy oWaining Ml- 
lengd. CDNA s«pence include RACE (rapid ampUScadon of cDNA ends) as descrrb^y 
Frohman « -I, (Proc. Nad. Acad. Sci. USA 85. 8998-9002 (1988)), «.d n=suicdon-s.« KR. 
Which uses univ^sal primers ,0 retrieve unknown nucleie acid se,»e«. a,Sac=« to a know^ 
,5 locus (Sarkar, O. (1993) PCR Me«,ods Applic, 2:3.^322). "Inverse PCR" may also be use^i 
to amplify or to «ttend seq»e»=es using div«gent primers based on a town region (TngUa. T. 
« oL (1988) Nudeic Adds Res. 16:8186). AnoU« method which may be used is "capmre 
PCR- which involves PCR amplificadon of DNA SragmenU adjacent to a known sequence m 
human and yea« artificial chromosome DNA (Ugers^m. M. « (1991) PGR Medrods 
20 Applic ,.nl-119).Ano4»rmed,odwhichmaybeusedtoretrieve»»knowns«r«nces«d«t 
otPaA^. IB. « A (my. Nud^c Acids Res. 19:3055-3060). Additionally, one may u«= 
PCR. nested primers, and libraries, such as d. ftomoterfinder™ library (Clontech. Palo Al« 
CA) to walk genomic DNA. TOs latter process avoids d» need to screen Ubraries and is useftJ 
in finding intron/exon junctions. 
25 When screening for fitl-lengd, cDNAs, it is preferable to use libr»ies dta. have been s»- 
selected », incl»ie larger cDNAs. Also, ra»iom-primed libr^ies are prefemble, in they 
. will contain more sequences dta. contain d» S' tegions of g.n«. Use of a randomly pnmed 
libnuy may be especidly preferable for sibradons in which an oUgo d(r) library does no. yeld 
a fuB-length cDNA. (3enomic libnmes may be usefnl for extension of sequence into 5' non- 
30 transcribed regulatory re^ons. 

In one embodiment of theinvention.anucldcacidmoleculeaccc^ding to the m^^^^^ 

nsed for chromosome localisation. In this technique, a nucleic acid molecule is specifically 

targeted to. and can hybridise with, a particular location on an individual human chromosome. 
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The mapping of relevant sequences to chromosomes is an important step in the confirmatory 
correlation of those sequences with the gene-associated disease. Once a sequence has been 
mapped to a precise chromosomal location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. Such data are found in, for example, 

5 McKusick, Mendelian Inheritance in Man (available on-line through Johns Hopkins University 
Welch Medical Library). The relationships between genes and diseases that have been mapped 
to the same chromosomal region are then identified through linkage analysis (coinheritance of 
physically adjacent genes). This provides valuable information to investigators searching for 
disease genes using positional cloning or other gpne discovery techniques. Once the disease or 

10 syndrome has been crudely localised by genetic Unkage to a particular genomic region, any 
sequences mapping to that area may represent associated or regulatory genes for further 
investigation. The nucleic acid molecule may also be used to detect differences in the 
chromosomal location due to translocation, inversion, etc. among normal, carrier, or affected 
individuals. 

15 Nucleic acid molecules of the present invention are also valuable for tissue localisation. Such 
techniques facilitate the determination of expression patterns of the polypeptide in tissues by 
detection of the mRNAs that encode them. These techniques include in sUu hybridisation 
techniques and nucleotide amplification techniques, such as PGR. Results from these studies 
provide an indication of the normal fimctions of the polypeptide in the organism, as well as 

20 highlighting the involvement of a particular gene in a disease state or abnormal physiological 
condition. 

In addition, comparative studies of the normal expression pattern of mRNAs with that of 
mRNAs encoded by a mutant gene provide valuable insights into the role of mutant 
polypeptides in disease. Such inappropriate expression may be of a temporal, spatial or 
25 quantitative nature. 
Vectors 

The nucleic acid molecules of the present invention may be incorporated into vectors for 
cloning (for example. pBluescript made by Stratagene) or expression purposes. Vectors 
containing a nucleic acid molecule explicitly identified herein (or a variant thereof) form 
30 another aspect of this invention. The nucleic acid molecule may be inserted into an appropriate 
vector by any variety of well known techniques such as those described in Sambrook et al 
[supra]. Generally, the encoding gene can be placed under the control of a control element such 
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as a promoter, ribosome binding site or operator, so that the DNA sequence encoding the 

desired polypeptide is transcribed into RNA in the transformed host cell. 

Vectors may be derived fiom various sources including, but not limited to bacterial plasmids. 

bacteriophage, transposons. yeast episomes. insertion elements, yeast chromosomal elements. 
5 viruses for example, baculoviruses and SV40 (simian virus), vaccinia viruses, adenoviruses. 

fowl pox viruses, pseudorabies viruses and retroviruses, or combinations thereof, such as those 

derived from plasmid and bacteriophage genetic elements, including cosmids and phagemids. 

Human, bacterial and yeast artificial chromosomes (HACs, BACs and YACs respectively) may 

also be employed to deliver larger fragments of DN A than can be contained and expressed in a 
10 plasmid. 

Examples of retroviruses include but are not limited to: murine leukaemia virus (MLV). human 
inununodeficiency virus (HIV), equine infectious anaemia virus (EIAV), mouse mammary 
tumour virus (MMTV). Rous sarcoma virus (RSV). Fujinami sarcoma virus (FuSV), Moloney 
murine leukaemia virus (Mo-MLV), FBR murine osteosarcoma virus (FBR MSV), Moloney 
15 murine sarcoma virus (Mo-MSV). Abelson murine leukaemia virus (A-MLV), Avian 
myelocytomatosis virus-29 (MC29), and Avian erythmblastosis virus (AEV). A detailed Ust of 
retroviruses may be found in Coffin et al ("Retroviruses" 1997 Cold Spring Harbour 
Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus pp 758-763). 
Lentiviruses can be divided into primate and non-primate groups. Examples of primate 
20 lentiviruses include but are not limited to: the human immunodeficiency virus (HIV), the 
causative agent of hmnan auto-immunodeficiency syndrome (AE)S), and the simian 
immunodeficiency virus (SIV). The non-primate lentiviral group includes the prototype "slow 
virus" visna/maedi vims (VMV), as well as the related caprine arthritis-encephalitis virus 
(CAEV), equine infectious anaemia virus (EIAV) and the more recently described feline 
25 immunodeficiency virus (FIV) and bovine immunodeficiency virus (BIV). 

A distinction between the Antivirus femUy and other types of retroviruses is that lentiviruses 
have the capability to infect both dividing and non-dividing cells (Lewis et al 1992 EMBO. J 
U: 3053-3058; Uwis and Emerman 1994 J. Viiol. 68: 510-516). In contrast, other 
retroviruses - such as MLV - are unable to infect non-dividing cells such as those that make up. 
30 for exanq)le, muscle, brain, lung and liver tissue. 

A vector may be configured as a spUt-intron vector. A split intron vector is described in PCT 
patent applications WO 99/15683 and WO 99/15684. 
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If the features of adenoviruses are combined with the genetic stability of 
retroviruses/lentiviruses then essentially the adenovirus can be used to transduce target cells to 
become transient retroviral producer cells that could stably infect neighbouring cells. Such 
retroviral producer cells engineered to express an antigen of the present invention can be 
5 implanted in organisms such as animals or humans for use in the treatment of angiogenesis 
and/or cancer. 

Poxvirus vectors are also suitable for use in accordance with the present invention. Pox viruses 
are engineered for recombinant gene expression and for the use as recombinant live vaccines. 
This entails die use of recombinant techniques to introduce nucleic acids encoding foreign 
10 antigens into the genome of the pox virus. If the nucleic acid is integrated at a site in the viral 
DNA which is non-essential for the Ufe cycle of the virus, it is possible for tiie newly produced 
recombinant pox virus to be infectious, that is to say to infect foreign cells and thus to express 
the integrated DNA sequence. The recombinant pox virus prepared in this way can be used as 
live vaccines for the prophylaxis and/or treatment of pathologic and infectious disease. 

15 For vaccine delivery, preferred vectors are vaccinia virus vectors such as MVA or NYVAC. 
Most preferred is the vaccinia strain modified virus ankara (MVA) or a strain derived 
tiierefrom. Alternatives to vaccinia vectors include avipox vectors such as fowlpox or 
canarypox known as ALVAC and strains derived tiierefrom which can infect and express 
recombinant proteins in human cells but are unable to replicate. 

20 Bacterial vectors may be also used, such as salmonella, listeria and mycobacteria. 

Vectors containing tiie relevant nucleotide sequence may enter tfie host cell by a variety of 
metiiods well known in the art and described in many standard laboratory manuals (such as 
Sambrook et aU [supra], Ausubel et al, [supra], Davis et d., Basic Metiiods in Molecular 
Biology (1986)). Metiiods include calcium phosphate transfection, cationic lipid-mediated 
25 transfection, DEAE-dextran mediated transfection, electroporation, microinjection, scrape 
loading, transduction, and ballistic introduction or infection. 

Host cells 

The choice of host cells is often dependent on the vector type used as a carrier for tiie nucleic 
acid molecule of the present invention. Bacteria and odier microorganisms are particularly 
30 suitable hosts for plasmids, cosmids and expression vectors generally (for example, vectors 
derived from die pBR322 plasmid), yeast are suitable hosts for yeast expression vectors, insect 
cell systems are suitable host for virus expression vectors (for example, baculovfrus) and plant 
cells are suitable hosts for vectors such as die cauliftower mosaic virus (CaMV) and tobacco 
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mosaic vims (TMV). Other expression systems include using animal cells (for example, with 
the LentiVectors™, Oxford BioMedica) as a host ceU or even using ceU-free translating 
systems. Some vectors, such as "shuttle vectors" may be maintained in a variety of host cells. 
An example of such a vector would be pEG 202 and other yeast two-hybrid vectors which can 
5 be maintained in both yeast and bacterial cells (see Ausubel et aL. [supra] and Gyuiis, J., Cell, 
75,791-803). 

Examples of suitable bacterial hosts' include Streptococci, Staphylococci. Escherichia coU. 
Streptomyces and Bacillus Mis cells. Yeast and fungal hosts im:bide Saccliaromyces 
cerevisiae and Aspergillus cells. Mammalian cell hosts include many immortalised cell lines 

10 available fiom the American Type Culture CoUection (ATCC) such as CHO (Chinese Hamster 
Ovary) cells, HeLa cells, BHK (baby hamster kidney) cells, monkey kidney ceUs, C127, 3T3, 
BHK, HEK 293, Bowes melanoma and human hepatocellular carcinoma (for example. Hep 
G2) cells. Insect host cells that are used for baculovirus expression include Drosophila S2 and 
Spodoptera Sf9 cells. Plant host ceUs include most plants from which protoplasts be isolated 

15 and cultured to give whole regenerated plants. Practically, all plants can be regenerated from 
culnired cells or tissues, mcluding but not limited to all major species of sugar cane, sugar beet, 
cotton, fruit and other trees, legumes and vegetables. 

Expression systems 

Also included in present invention are expression vectors that comprise a nucleic add 
20 molecule as described above. Expression vectors and host cells are preferably chosen to give 
long term, high yield production and stable expression of the recombinant polypeptide and its 
variants. 

Expression of a polypeptide can be effected by cloning an encoding nucleic acid molecule into 
a suitable expression vector and inserting this vector into a suitable host cell. The positioning 

25 and orientation of the nucleic acid molecule insert with respect to the regulatory sequences of 
the vector is important to ensure that the coding sequence is properly transcribed and 
translated. Alternatively, control and other regulatory sequences may be ligated onto the 
nucleic acid molecule of this invention prior to its msertion into the expression vector. In both 
cases, the sequence of the nucleic acid molecule may have to be adjusted in order to effect 

30 correct transcription and translation (for example, addition of nucleotides may be necessary to 
obtain the correct reading frame for translation of the polypeptide from its encoding nucleic 
acid molecule). 
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A nucleic acid molecule of the invention may comprise control sequences that encode signd 
peptides or leader sequences. These sequences may be useful in directing the translated 
polypeptide toavarietyoflocations within or ou,.ide the host cell, such as to the lumen of the 

endoplasmic reticulum, to the nucleus, to tte periplasmic space, or into the extracellular 
5 environment. Such signals may be endogenous to the nucleic acid molecules of the invenUon, 
or may be a heterologous sequence. lUese leader or control sequences may be removed by the 
host during post-translational processing. 

A nucleic acid molecule of the present invention may also comprise one or more regulatory 
sequences that allow for regulation of the expression of polypeptide relative to the growth of 
10 the host cell. Alternatively, these regulatory signals may be due to a heterologous sequence 
from the vector. Stimuli that these sequences respond to include those of a physical or 
chemical nature such as the presence or absence of regulatory compounds, changing 
temperamres or metabolic conditions. Regulatory sequences as described herein, are non- 
translated regions of sequence such as enhancers, promoters and the 5' and 3' untranslated 
15 regions of genes. Regulatory sequences interact with host cellular proteins that carry out 
translation and transcription. These regulatory sequences may vary in strength and speaficity. 
Examples of regulatory sequences include those of constitutive and inducible promoters. In 

bacterial systems, an example of an inducible promoter is the hybrid loci promoter of the 

Bluescript phagemid (Stratagene, LaJoUa. CA) or pSportiTM plasmid (Gibco BRL). The 
20 baculovirus polyhedrin promoter may be used in insect cells. 

An example of a preferred expression system is tiie lentivirus expression system, for example, 

as described in International patent appUcation W098/17815. 

Detection of uptake of vectors by the host organism 

Various methods are known in tiie art to detect the uptake of a nucleic acid or vector molecule 
25 by a host cell and/or the subsequent successful expression of the encoded polypeptide (see for 
example Sambrook et al., [supra]). 

Vectors fxequentiy have marker genes that can be easily assayed. Ihus. vector uptake by a host 
cell can be readily detected by testing for the relevant phenotype. Mari^rs include, but are not 
limited to those coding for antibiotic resistance, herbicide resistance or nutnUonal 
30 requirements. The gene encoding dihydrofolate reductase (DHFR) for example, confers 
resistance to methotrexate (Wigler. U, et oL (1980) PNAS 77:3567.70) and the gene npt 
confers resistance to tiie aminoglycosides neomycin and G^18 (Colbere-Garapin. F. et al 
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(1981) J. Mol. Biol. 150:1-14). Additional selectable genes have been described, examples of 
which will be clear to those of sldllin the art. 

Markers however, only indicate that a vector has been taken up by a host cell but does not 
distinguish between vectors that contain the desired nucleic acid molecule and those that do 

5 not. One method of detecting for the said nucleic add molecule is to insert the relevant 
sequence at a position that will disrupt the transcription and translation of a marker gene. 
These cells can then be identified by the absence of a marker gene phenotype. Alternatively, a 
maricer gene can be placed in tandem with a sequence encoding a polypeptide of the invention 
under the control of a single promoter. Expression of the marker gene in response to induction 

10 or selection usually indicates expression of the tandem gene as well. 

More direct and definitive methods to detect the presence of the nucleic acid molecule of the 
present invention include DNA-DNA or DNA-RNA hybridisation with a probe comprising the 
relevant antisense molecule, as described above. More direct methods to detect polypeptide 
expression include protein bioassays for example, fluorescence activated ceU sorting (FACS), 

15 immunoassay techniques such as ELBA or radioimmunoassays. 

Alternative methods for detecting or quantitating the presence of the nucleic acid molecule or 
polypeptide of this invention include membrane, solution or chip-based technologies (see 
Hampton, R. et aL, (1990) Serological Methods, a Laboratory Manual. APS Press, St Paul, 
MN) and Maddox. DB. et aL, (1983) J. Exp. Med, 158, 1211-1216). 

20 Transgenic animals 

In another embodiment of this invention, a nucleic acid molecule according to the invention 
may be used to create a transgenic animal, most commonly a rodent. The modification of the 
animal's genome may either be done locally, by modification of somatic ceUs or by germ Une 
therapy to incoiporate inheritable modifications. Such transgenic animals may be particularly 
25 useful in the generation of animal models for drug molecules effective as modulators of the 
polypeptides of the present invention. 

Polypeptide purification 

A polypeptide according to the invention may be recovered and purified ftom recombinant cell 
cultures by methods including, but not limited to ceU lysis techniques, ammonium sulphate 
30 precipitation, ethanol precipitation, acid extraction, anion or cation chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 



wo 02/068466 



PCT/GB02/OO817 



29 



Chromatography, hydroxylapatite chromatography and lectin chromatography, high 
performance liquid chromatography (HPIX:) or fast performance liquid chromatography 
(FPLC). me polypeptide may need refolding after purification or isolation and many well 
known techniques are available that wiU help regenerate an active polypeptide conformation. 
5 Many expression vector, are commercially available that aid purification of the relevant 
polypeptide. These include vectors that join the sequence encoding the polypeptide to another 
expressed sequence creating a fused protein that is easier to purify. Ways in which these fused 
parts can facilitate purification of the polypeptide of this invention include fusions that can 
increase the solubility of the polypeptide, joining of metal chelating peptides (for example. 
10 histidine-tryptophan modules) that aUow for purification with immobilised metals, joining of 

protein A domains which allow for purification with immobilised immunoglobulins and the 
joining of the domain that is utilised in the FLAGS extension/affinity purification system 

(Immunex Corp.. Seattle. WA). Fusion of the polypeptide of this present invention with a 

secretion signal polypeptide may also aid purification. This is because the medium into which 
15 the fused polypeptide has been secreted can subsequendy be used to recover and purify the 

expressed polypeptide. 

If necessary, these extraneous polypeptides often comprise a cleavable linker sequence which 
allows the polypeptide to be isolated from the fusion. Qeavable linker sequences between the 
purification domain and the polypeptide of the invention include those specific for Factor Xa 

20 or for enterokinase (Invitrogen, San Diego. CA). One such expression vector provides for 
expression of a fusion protein containing the polypeptide of the invention fused to several 
histidine residues preceding a thioredoxin or an enterokinase cleavage site. TTie histidine 
residues facilitate purification by IMAC (immobilised metal ion affinity chromatography as 
described in Porath, J. et al (1992). Prot Exp. Purif. 3: 263-281), whUe the thioredoxin or 

25 enterokinase cleavage site provides a means for purifying the polypeptide from tiie fiision 
protein. A discussion of vectors that contain fiision proteins is provided in KroU, D J. et al. 
(1993; DNA Cell Biol. 12:441-453). 
Assays 

. Another aspect of this invention includes assays fliat may be carried out using a polypeptide or 
30 nucleic acid molecule according to the invention. Such assays may be for many uses including 
the development of drug candidates, for diagnostic purposes or for tiie gatiiering of information 

for therapeutics. 
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Ip^Il*=»^of,>».K«.ceHinwhichUis«p«sed..„^s«..U,ehos,««s 
' b. barv««d prior » « in >he a»a,, foe example using «hn.^ so* as 

n„™ ac^ .a»d «n sowing (FAC) c i™.nn03«nity ^ni,.«. K po.^^* - 
5 .««=d into *en,edinn.««n»diun. can be recov^ea in crier »«co« and p«n^«« 

ex^sse. pol^de. H pol^de produced in«ce>.«y, *e ceU» n.u« fir. be I,«d 
before the polypeptide is recovered. 

The poiypepSde of inv«,ta can be u»d «> screen Ubr^es of coords in any of a 
of screening Such con,^ nray acttva« (agon,se) or *M 

,0 (anugonise) d,e level of expressi™, of d» gene or *e a«ivi«. of fl» pol,^«de of 
LJio.a„d,on.af.r.heraspec.of««presen.i.vendon.Exarnplesofs„i,abiec«^^^ 

^ toe v*icb are effecdve ,0 alBr 4» expression of a nannd gene «b.ch encodes a 
poIypepHdc of*e invention ortoregulatefteactivityotapolypepfide of fl^invennon. 

Agonis, or anugonis. con^ n»y ^ -lated *on>, for example, cell-f.ee 
15 pIpara«ons,ci«McaiBbraries.na»ralpcod„crnnx.«r=s.Th.seagoms^ 

be natural or modified s»bs»es. ligands, enzyme^ receptors or »u«u,al or ta«.o.al 
^imedcs. For a sniuble teview of such screening techd^es. see Coligan e, al„ Conent 
Pmtocols in Immunology l(2)«iapBr 5 (1991). 

Po^ntial agonists or antagonists i»lnde small orgamc molecules, pepddes, polypeptides and 
20 andbodies that bind to d» polypepdde of tta invendon ^ 
tbis fashion, billing of .he polypepdde to nonnal cellular binding mole^ ^ 
p«endatedori«hibi.ed.sucb.ha.d»norm.l«.logic.lacdvityof.hepolypep«de,sed>«K.d 

or prevented. 

The polypeptide of .he invention dta. is employed in such a sceening technique may be free in 
25 solution,affixod«.asolidsuppo*bon»onacellsurfaceorlocatedin«aceUulaHylngeneral, 

»ch sceenlug procedures ma, involve using appropriate cdls or cell membranes d». exp«s 
^ polypeptide that are contacted wid, a test compound to observe binding, or stmmUnon or 
iaMWtionofafimotionaltesponse. TV fimetional response of d^cdlsoont^ 
compound is d« compared v«*conK,l cells we,. «.tcontac»d«ititd« .est compc^L 

30 Suchanassa,mayassesswhed»rd».=s.c«np<»ndresdBin.sip»lP»«'^'^-'"*"™ 
of tire polypeptide.using an appropri.^ deletion syst™.lnhibi»rs of activadon at. generaB, 

assay«l to the pr«.nce of a knov™ agonist and d,e eftct on activation by fte agomst .0 .he 
presence of the test compound is observed. 
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AI.em.Wy, ^ binding assays may ^ n«d, in which the adhe«nce of a ^ compcnnd 
ZJJr,, ^ . d^eced .y ^eans Of a « ^y^^^ 

^ *e ^ coc^ 0, in an assay in^Hn, ^'^J^'^^ 
compea™. In anote embodin», compeddve dmg screening assays may be used, m which 
, rl-.nga„d^.s*a.a.capa.eorhind.gd»P.,^ddespec^,c»pe.»*^^ 
.es.eompo«ndforbMng.Ind.sma.ner,.hea«i^eacani«»sed»de.eC«.ep«enceo, 

an, compound to. possesses specific binding affini^- dK polypepdde. 
Msaysn.yalsobedesip«d«de.^d.effec,o,.dded««nponndsond«pr»W^^^ 
encoding d-e po,ypepdde in ceU. Per exan.,. an EUSA may be cons»c«^^-^ 
,0 measles sece^d or cen..ssocia.ed .e.eU of polypeptide using monodonal or polydon^ 
.„dhodieshys»dardn>ed,odsknownin4ea,.andd,isc^beused„se«bforco.^n2 

^.may inMbi. or enhance *e p«d.cdon of d» polypeptide ftom sniubl, nu>mpnU»d ce^ 
„ tissue. formation of binding complexes between ti« polypc-ide >^ ^ compound 
being tested may then be measured. 

r A c™,mB which may be used provides for high throughput 
15 Another technique for drug screening wMcn may qc u»c f 

screeniug of compounds having suifable Mndlng affini. . d« polypeptide .^^^^^^^ 

ta»rnatio.alpa«n,appticationW084A«564,.l.«sn»*od.largenumt«sofd,«^n.^^^^ 
« compounds are synd«sised on a solid s.bsti«e, which may *en be resell w.ft ,he 
polypeptide of d» invention and washed. way of immobiUsing d» polypeptide rs .o use 

a, uon-neutiatising antibodies. Bound poly^de may .hen be deleted using methods ^ » 
^,lla.ownind».«.I«fledpolypeptideca„alsobecoa«ddirecdyon»pla»sfb,»semti,e 

aforementioned drug screening techniques. 

A polypeptide ac^ .o d» invemion may be used .o identify membrane-bound or soluble 
^.ors. *.onsh s»nd.rd recep» bindi.^ «tai,nes ^ ^ known in such - 

25 Ugand binding and cossliddng assays in which d» polypeptide is UbeUed witi. . radioactive 
iso«,pe.ischemic.llymodified,orisf»sed».peptidese^nce«facili.a.esi.de.ection^- 

purification, ^ incub.«d wid, a source of 4c puutive recep«>r (for example, a co^os«o„ 
„, cells, cell membranes, cell supema«n., tis»e extiacs, or bodily fluids). Ibe efficacy o 
M„g may be measured udng biophysical «chm,u« such as ^ plasmon resonance and 
30 spec^oscopy. Binding assays may l« used ibr ti» purification and donm, of ti» rec^. b« 
™,d«,identif,.gonistsandanUgonis.softi«pol,pep«de,fl«.compc«wid,«>ebmdmg<rf 

polypeptide » iK receptor. Sudani mediuds ta conducting sceernng assays weH 

understood in the art. 
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or as a fusion with a peptide or poiypep ^^rfonning a competition 

- - ' "-^'":LIlC.:r*a.«c.pa-o,spec.o..y 

A^mnPtp with a test compound tor omoing. ^ 

»tib<*« n,a, be used °* 7 "Tl ^ L ^own in 

J • 1 h«t are not limited to. cross-linking assays and filter binoing y 
art and include, but are not limi . • i„,i^ding s^^^^ 

efficacy of binding may be measured usmg biophysical techmq 

20 resonance and spectroscopy. 

• • „tvn.ofassavwhichenablesalargenumberofcompoundstobe 
»tbroughputscreemngisatypeofassay.h^^ 

searched for any significant bindmg ac..ty « scenario, many 

applicationWO84/03564).TMsisparticdarlyusefiiland™gs^^^^ 

^air.c«yo„op.a»s(or»»i«0«*-->''>«^*»^»™»°^'^- 

«. H« are also indoded wiWn *c tenm of tte present tovettion are .hose that 
^, "^"^ *"r»I ld pobpeptides of *e in«n.on in « ablation 

30 involve the use of ihe genes ana poiypep ,rt„„.~sto,lypeptides in cells and 

.say.Snc.assaysinvolved«n».p«l«^onof.«e.^d««g^^^ 

«f thi- inroact of this manipulation event on the phy aoiogy oi f 
assessment of the impaci 01 uiK. t . ^»,u„nrnathwavs in which the 

^e.an^e,s.c.e«p«in»n«,e,ea.de*ofsign.,..^^^^^ 

partcular genes/p;ayw«ides are implica^d. g«««.e mfbnnanon regardmg 
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Which related genes and FOteins are regulated. 

^ ^ Of ,Hs .venuo, p.*» for a., — a. O.^ o, a=ve,oped 

from any of the aforementioned assays. 
5 C Pharmaceuticals 

^ J fuic nrpoent invention (herein known as aj is 

lx.y.asa.Mo«p«f=.aW,Xco^a.>.aa.95*,«».d»o«p,rfe»«,99%of 

the total X+Y mass. 



15 Carriers 



Carriers 

■ , 1 , „,av be senes polypeptides, antibodies, liposomes or indeed any other agent 
Carrier molecules may be genes, pui^p f „«^H,irtion of 

„„ided ta. *= cani« dc« no. Wf ta*c .oxici^ »' ^ 

ca^e. .CO. po,.a^..d. p«„>a.o a* M,^-- - 
20 inacdv. vim pa«.=l«. Cam«s ^ i»cl«de phan.a«udcali, acoep* . 

orga.0 acid, (for oxar^^lc, ac^a^e. propionates, malona., — 
accepUbU carries addiUonal., co.»in ,Mds such as 

as !L«saad*eU^.C»iers,n^=..bIe«»pbar,nac...ca.co,npo.«ons.b.— 
*lo«. pma, drape caps*s. l^d. ^. '° 
RMtagton's Pharnnu^ulical Science (MackPub. Co., NJ. 1991). 

30 ^ „™n.ofco,.ponen.X,»*ocon^*i«.sb«».d.l.obo»*er.pcuMycao«^^^ 
I a»Uora«, or (f^r oxa.,,.0. when .sod as . vacc^) a «^ 
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amount" may be by carrying out cell culture assays (for example, using neoplastic cells) or 
using animal models (for example, mice, rabbits, dogs or pigs). In addition to determining the 
appropriate concentration range for X to be therapeutically effective, animal models may also 
yield other relevant information such as preferable routes of administration that will give 
5 maximum effectiveness. Such information may be useful as a basis for patient administration. 
A "patient" as used in herein refers to the subject who is receiving treatment by administration 
of X. Prefefafc>ly; the patient is human, but the term may also include animals. 

The ther^eutically-effective dosage will generally be dependent on the patient's status at the 
time of administration. Factors diat may be taken into consideration when determining dosage 

10 include the severity of the disease state in the patient, the general health of the patient, the age, 
weight, gender, diet, time and frequency of administration, drug combinations, reaction 
sensitivities and tiie patient's tolerance or response to the tiierqjy. The precise amount can be 
determined by routine experimentation but may ultimately lie with the judgement of the 
clinician. Generally, an effective dose will be from 0.01 mg/kg (mass of drug compared to 

15 mass of patient) to 50 mg/kg, preferably 0.05 mg/kg to 10 mg/kg. Compositions may be 
administered individually to a patient or may be administered in combination with other 
agents, drugs or hormones. 
Routes of administration 

Uptake of a pharmaceutical composition of die invention by a patient may be initiated by a 
20 variety of methods including, but not limited to enteral, intira-arterial, intrathecal, 
intramedullary, intramuscular, intranasal, intraperitoneal, intravaginal, intravenous, 
intraventricular, oral, rectal (for exanq)le, in die form of suppositories), subcutaneous, 
sublingual, transcutaneous applications (for example, see WO98/20734) or transdermal means. 

Gene guns or hyposprays may also be used to administer die pharmaceutical compositions of 
25 the inventiion. Typically, the tiierapeutic compositions may be prepared as injectables, either as 
liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection may also be prepared. Direct delivwy of the compositions can 
generally be accomplished by injection, subcutaneously, intraperitoneally, intravenously or 
intramuscularly, or delivered to die interstitial space of a tissue. The compositions can also be 
30 administered into a lesion. Dosage treatment may be a single dose schedule or a multiple dose 
schedule. 
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Inhibition of excessive activity 

If a particular disease state is partiaUy or completely caused by an inappropriate excess in the 
activity of a polypeptide according to the invention, several apFoaches are available for 
inhibiting this activity. 

5 One approach comprises administering to a patient an inhibitor compound (antagonist) along 
with a pharmaceutically acceptable carrier in an amount effective to inhibit the function of the 
polypeptide, such as by blocking the binding of a Ugand, substrate, enzyme, receptor, or by 
inhibiting a second signal, and thereby alleviating the abnormal condition. Such an antagonist 
molecule may, for example, be an antibody. Most preferably, such antibodies are chimeric 

10 and/or humanised to minimise their immunogenicity, as previously described. 

In another approach, soluble forms of the polypeptide that retain binding affinity for the ligand, 
substrate, enzyme, receptor, in question, may be administered to the patient to compete with 
the biological activity of the endogenous polypeptide. Typically, the polypeptide may be 
administered in the form of a fragment that retains a portion that is relevant for the desired 
15 biological activity. 

In an alternative approach, expression of the gene encoding the polypeptide can be inhibited 
using expression blocking techniques, such as by using antisense nucleic acid molecules (as 
described above), either internally generated or separately administered. Modifications of gene 
expression may be effected by designing complementary sequences or antisense molecules 

20 (DNA, RNA, or PNA) to die control, 5' or regulatory regions (signal sequence, promoters, 
enhancers and intions) of the gene encoding the polypeptide. Similariy, inhibition can be 
achieved using "triple heUx" base-pairing methodology. Triple helix pairing is useful because 
it causes inhibition of Ae ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent thorapeutic advances uang 

25 triplex DNA have been described in the literature (Gee, JJE. et al. (1994) In: Huber. BE. and 
B.I. Cair. Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, NY). 
The complementary sequence or antisense molecule may also be designed to block translation 
of mRNA by preventing the transcript from binding to ribosomes. Such oligonucleotides may 
be administered or may be generated in situ from expression in vivo. 

30 Gene alencing approaches may also be undertaken to down-regulate endogenous expression of 
a gene. RNA interference (RNAi) (Elbashir, SM et al.. Nature 2001, 411, 494-498) is one 
mediod of sequence specific post-transcriptional gene silencing that may be employed. Short 
dsRNA oligonucleotides are synthesised in vitro and introduced into a cell. The sequence 
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specific binding of these dsRNA oligonucleotides triggers the degradation of target mRNA, 
reducing or ablating target protein expression. 

In addition, expression of a polypeptide according to die invention may be prevented by using 
a ribozyme specific to the encoding mRNA sequence for the polypeptide. Ribozymes are 
5 catalytically active RNAs that can be natural or synthetic (see for example Usman, N, et aL, 
Curr. Opin. Struct. Biol (1996) 6(4), 527-33). Syntiietic ribozymes can be designed to 
specifically cleave mRNAs at selected positions thereby preventing translation of the mRNAs 
into functional polypeptide. Ribozymes may be synfliesised with a natural ribose phosphate 
backbone and natural bases, as normally found in RNA molecules. Alternatively the ribozymes 
10 may be synthesised with non-namral backbones, for example. 2'-0-methyl RNA, to provide 
protection fiom ribonuclease degradation and may contain modified bases. 
Efficacy of tiie gene silencing approaches assessed above may be assessed through the 
measurement of polypeptide expression (for example, by Western blotting), and at tiie RNA 
level using TaqMan-based metiiodologjes. 
15 RNA molecules may be modified to increase their intracellular stability and half-life. Possible 
modifications include, but are not limited to, ±t addition of flanking sequences at the 5' and/or 
3' ends of the molecule or the use of phosphorothioate or 2' O-methyl rather than 
phosphodiesterase linkages within the backbone of ttie molecule. This concept is inherent in 
the pixKiuction of PNAs and can be extended in all of tiiese molecules by the inclusion of non- 
20 traditional bases such as inosine, queosine and butosine, as well as acetyl-, metiiyl-, tiiio- and 
similarly modified forms of adenine, cytidine, guanine, thymine and uridine tiiat are not as 
easily recognised by endogenous endonucleases. 

Activation of a polypeptide activity 

If a particular disease state is partially or completely due to a lowered level of biological 
25 activity fi'om a polypeptide according to tiie invention, various metiiods may be used. An 
example of such a metiiod includes administering a tiierapeutically effective amount of 
compound tiiat can activate (i.e. an agonist) or cause increased expression of tiie polypeptide 
concerned. Administration of such a compound may be via any of the metiiods described 
previously. 
30 Gene Therapy 

Anotiier aspect of tiie present invention provides for gene tiierapy metiiods involving nucleic 
acid molecules identified herein. Gene therapy may be used to affect the endogenous 
production of tiie polypeptide of tiie present invention by relevant cells in a patient For 
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example, gene therapy can be used pennanently to treat the inappropriate production of a 
polypeptide by replacing a defective gene with the corrected therapeutic gene. 

Treatment may be effected either in vivo or ex vivo. Ex vivo gene therapy generally involves 
the isolation and purification of the patient's cells, introduction of the therapeutic gene into the 

5 cells and finally, the introduction of the genetically-altered cells back into die patient. In vivo 
gene therapy does not require the isolation and purification of patient cells prior to the 
introduction of the therapeutic gene into the patient. Instead, the therapeutic gene can be 
packaged for delivery into the host. Gene delivery vehicles for in vivo gene ther^y include, 
but are not limited to, non-viral vehicles such as liposomes, replication-deficient viruses (for 

10 example, adenovirus as described by Berkner, KJL., in Curt. Top. Microbiol. Immunol.. 158, 
39-66 (1992)) or adeno-associated virus (AAV) vectors as described by Muzyczka, N., in Curr. 
Top. Microbiol. Immunol., 158, 97-129 (1992) and U.S. Patent No. 5,252,479. Alternatively, 
"naked DNA" may be direcfly injected into the bloodstream or muscle tissue as a form of in 
vivo gene therapy. 

15 One example of a strategy for gene therapy including a nucleic acid molecule of this present 
invention may be as follows, A nucleic acid molecule encoding a polypeptide of the invention 
is engineered for expression in a replication-defective retroviral vector. This expression 
construct may then be isolated and introduced into a packaging cell transduced with a 
retroviral plasmid vector containing RNA encoding the polypeptide, such that the packaging 

20 cell now produces infectious viral particles containing the gene of interest. These producer 
cells may be administered to a patient for engineering cells in vivo and expression of the 
polypeptide in vivo (see Chapter 20, Gene Therapy and other Molecular Genetic-based 
Therapeutic Approaches, (and references cited therein) in Human Molecular Genetics (1996), 
T Strachan and A P Read, BIOS Scientific Publishers Ltd). 

25 Genetic delivery of antibodies diat bind to polypeptides according to the invention may also be 
effected, for example, as described in International patent application WO98/55607. 

Vaccines 

A further embodiment of die present invention provides that the polypeptides or nucleic acid 
molecules identified may be used in the development of vaccines. Where the aforementioned 
30 polypeptide or nucleic acid molecule is a disease-causing agent, vaccine development can 
involve the raising of antibodies against such agents. Where die aforementioned polypeptide or 
nucleic acid molecule is one that is up-regulated, vaccine development can involve the raising 
of antibodies or T cells against such agents (as described in WOOO/29428). 
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Vaccines according to the invention may either be prophylactic (i.e. prevents infection) or 
therapeutic (i.e. treats disease after infection). Such vaccines comprise immunising antigen(s), 
immunogen(s), polypeptide(s). protBin(s) or nucleic acid, usuaUy in combination with 
pharmaceutically-acceptable carriers as described above. Additionally, these carriers may 
5 function as immunostimulating agents ("adjuvants"). Furthermore, the antigen or immunogen 
may be conjugated to a bacterial toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. 
pylori, and other pathogens. 

Vaccination processes may involve the use of heterologous vectors eg: prime with MVA and 
boost with DNA. 

10 Since polypeptides may be broken down in the stomach, vaccines comprising polypeptides are 
preferably administered parenterally (for instance, subcutaneous, intramuscular, intravenous, or 
intradermal injection). Formulations suitable for parenteral administration include aqueous and 
non-aqueous sterile injection solutions that may contain anti-oxidants, buffers, bacteriostats 
and solutes which render the formulation isotonic with the blood of the recipient, and aqueous 

15 and non-aqueous sterile suspensions which may include suspending agents or thickening 
agents. 

The vaccine formulations of the invention may be presented in unit-dose or muW-dose 
containers. For example, sealed ampoules and vials and may be stored in a freeze-dried 
condition requiring only the addition of the sterile liquid carrier immediately prior to use. The 
20 dosage wUl depend on the specific activity of the vaccine and can be readily determined by 
routine experimentation. 

The technology referred to as jet injection (see, for example, www.powderjecLcom) may also 
be useful in the formulation of vaccine compositions. 

In accordance with this aspect of the present invention, polypeptides can be delivered by viral 
25 or non-viral techniques. Non-viral delivery systems inctade but are not Umited to DNA 
transfection methods. Here, transfection includes a process using a non-viral vector to deUver 
a antigen gene to a target mammalian cell. Typical transfection methods include 
electroporation, nucleic add biolistics, lipid-mediated transfection, compacted nucleic acid- 
mediated transfection, liposomes, immunoliposMnes, lipofectin, cationic agent-mediated, 
30 cationic facial amphiphiles (CPAs) (Nature Biotechnology 1996 14; 556), multivalent cations 
such as spermine, cationic lipids or polylysine, 1, 2,-bis (oleoyloxy)-3-(trimethylammonio) 
propane (DOTAP)-cholesterol complexes (Wolff and Trubetskoy 1998 Natore Biotechnology 
16: 421) and combinations thoeof. 
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Viral delivery systems include but are not limited to adenovirus vectors, adeno-associated viral 
(AAV) vectors, herpes viral vectors, influenza, retroviral vectors, lentiviral vectors or 
baculoviral vectors, Venezuelan equine encephalitis virus (VEE), poxviruses such as: 
canarypox virus (Taylor et al 1995 Vaccine 13:539-549). entomopox virus (li Y et al 1998 
5 xnth International Poxvirus Symposium pl44. Abstract), penguine pox (Standard et al. J Gen 
Virol. 1998 79:1637-46) alphavirus, and alphavirus based DNA vectors. 

In addition to the use of polypeptide-based vaccines, this aspect of the invention includes the 
use of genetically-based vaccines, for example, those vaccines that are effective through 
eliciting the expression of a particular gene (either endogenous or exogenously derived) in a 
10 cell, so targeting this cell for destruction by the inmiune system of the host organism. 

A number of suitable methods for vaccination and vaccine delivery systems are described in 
International patent application WOOO/29428. 

D. Diagnostics 

Another aspect of the present invention provides for the use of a nucleic acid molecule 

15 identified herein as a diagnostic reagent 

For example, a nucleic acid molecule may be detected or isolated from a patient's tissue and 
used for diagnostic purposes. 'Tissue" as defined herein refers to blood, urine, any matter 
obtained from a tissue biopsy or any matter obtained from an autopsy. Genomic DNA from the 
tissue sample may be used directiy for detection of a hypoxia-related condition. Alternatively, 

20 the DNA may be amplified using metiiods such as polymerase chain reaction (PGR), the ligase 
chain reaction (LCR), strand displacement amplification (SDA), or other amplification 
techniques (see Saiki et al.. Nature, 324, 163-166 (1986); Bej, et al. Grit Rev. BiochenL 
Molec. BioL, 26, 301-334 (1991); Birkenmeyer et aU J. Virol. Meth., 35, 117-126 (1991) and 
Brunt, J., Bio/Technology, 8, 291-294 (1990)). Such diagnostics are particularly usefiil for 

25 prenatal and even neonatal testing. 

A method of diagnosis of disease using a polynucleotide may comprise assessing the level of 
expression of the natural gene and comparing the level of encoded polypeptide to a control 
level measured in a normal subject that does not suffer from the disease or physiological 
condition diat is being tested. The diagnosis may comprise the following steps: 

30 a) contacting a sample of tissue fix)m die patient with a nucleic add probe under stringent 
conditions that allow the formation of a hybrid complex between a nucleic acid molecule 
of the invention and the probe; 
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b) contacting a control sample with said probe under the same conditions used in step a); and 

c) detecting the presence of hybrid complexes in said samples; 

wherein detection of differing levels of the hybrid complex in the patient sample compared to 
levels of the hybrid complex in the control sample is indicative of die dysfunction. 
5 A fimher aspect of die invention comprises a diagnostic metiiod comprising tiie steps of: 

a) obtaining a tissue sample from a patient being tested for disease; 

b) isolating a nucleic acid molecule according to die invention from said tissue sample; 
and 

c) diagnosing the patient for disease by detecting tiie presence of a mutation in tbt nucleic 
10 acid molecule which is associated vyitii disease. 

To aid the detection of nucleic acid molecules in die above-described methods, an 
amplification step, such as PCR, may be included. An example of tiiis includes detection of 
deletions or insertions indicative of die dysfunction by a change in tiie size of die amplified 
product in comparison to die normal genotype. Point mutations can be identified by 

15 hybridising ampUfied DNA to labelled RNA of die invention or alternatively. labeUed 
antisense DNA sequences of die invention. Perfecfly matched sequences can be distinguished 
from mismatched duplexes by RNase digestion or by assessing differences in melting 
temperatures. The presence or absence of the mutation in the patient may be detected by 
contacting DNA witii a nucleic acid probe tiiat hybridises to the DNA under sttingent 

20 conditions to form a hybrid double-stranded molecule, die hybrid double-stranded molecule 
having an unhybridised portion of die nucleic acid probe strand at any portion corresponding to 
a mutation associated widi disease; and detecting die presence or absence of an unhybridised 
portion of die probe strand as an indication of die presence or absence of a disease-associated 
mutation in die corresponding portion of die DNA strand. 

25 Point mutations and odier sequence differences between die reference gene and "mutant" genes 
can be identified by otiier well-known techniques, such as direct DNA sequencing or single- 
strand conformational polymorphism, (see Orita et aL, Genomics. 5. 874-879 (1989)). For 
example, a sequencing primer may be used widi double-stranded PCR product or a single- 
stranded template molecule generated by a modified PCR. The sequence determination is 
30 performed by conventional procedures widi radiolabelled nucleotides or by automatic 
sequencing procedures widi fluorescent-tags. Qoned DNA segments may also be used as 
probes to detect specific DNA segments. The sensitivity of tiiis metiiod is greafly enhanced 
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when combined with PCR. Further, point mutations and other sequence variations, such as 
polymorphisms, can be detected as described above, for example, through the use of allele- 
specific oligonucleotides for PCR amplification of sequences that differ by single nucleotides. 
DNA sequence differences may also be detected by alterations in the electrophoretic mobUity 

5 of DNA fragments in gels, with or without denaturing agents, or by direct DNA sequencing 
(for example. Myers et aL, Science (1985) 230:1242). Sequence changes at specific locations 
may also be revealed by nuclease protection assays, such as RNase and SI protection or the 
chemical cleavage method (see Cotton et aL, PNAS. USA (1985) 85: 4397-4401). 
In addition to conventional gel electrophoresis and DNA sequencing, mutations such as 

10 microdeletions, aneuploidies, translocations, mversions. can also be delected by in situ analysis 
(see, for example, Keller et ai, DNA Probes. 2nd Ed., Stockton Press. New York. N.Y.. USA 
(1993)). that is. DNA or RNA sequences in cells can be analysed for mutations without need 
for their isolation and/or immobilisation onto a membrane. FISH is presently the most 
commonly applied method and numerous reviews of FISH have appeared (see. for example. 

15 Trachuck et al. Science. 250. 559-562 (1990), and Trask et aL, Trends. Genet., 7, 149-154 
(1991)). 
Arrays 

In another embodimem of the invention, an array of oligonucleotide probes comprising a 
nucleic acid molecule according to the invention can be constructed to conduct efficiem 
20 screening of genetic variants, mutations and polymorphisms. Array technology methods are 
well known and have general appUcability and can be used to address a variety of questions in 
molecular genetics including gene expression, genetic linkage, and genetic variability (see for 
example: M.Chee et al.. Science (1996), Vol 274, pp 610-613). 

In one embodiment, the array is prepared and used according to die methods described in 
25 W095/11995 (Ghee et d); Lockhart, D. J. et aL (1996) Nat. Biotech. 14: 1675-1680); and 
Schena. M. et al. (1996) PNAS 93: 10614-10619). OUgonucleotide pairs may range from two 
to over one million. The oligomers are syntiiesized at designated areas on a substrate using a 
light-directed chemical process. The substrate may be paper, nylon or otiier type of membrane, 
filter, chip, glass slide or any otiier suitable soUd support In another aspect, an oligonucleotide 
30 may be synthesized on the surface of the substrate by usmg a chemical coupling procedure and 
an ink jet application apparams, as described in PCT plication W095/251116 
(BaldeschweUer et oL). In anotiier aspect, a "gridded" array analogous to a dot (or slot) blot 
may be used to arrange and link cDNA fragments or oUgonucleotides to the surface of a 
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substrate using a vacuum system, thermal, UV, mechanical or chemical bonding procedures. 
An array, such as those described above, may be produced by hand or by using available 
devices (slot blot or dot blot apparatus), materials (any suitable solid support), and machines 
(including robotic instruments), and may contain 8, 24, 96. 384, 1536 or 6144 
5 oligonucleotides, or any other number between two and over one million which lends itself to 
the efficient use of comma-dally-available instrumentation. 

Diagnostics using polypeptides or mRNA 

In addition to the methods discussed above, diseases may be diagnosed by methods comprising 
determining, from a sample derived from a subject, an abnormally decreased or increased level 
10 of polypeptide or mRNA. Decreased or increased expression can be measured at the RNA 
level using any of the methods well known in the art for the quantitation of polynucleotides, 
such as, for example, nucleic acid ampUfication, for instance PGR. RT-PCR. RNase protection. 
Northern blotting and other hybridization methods. 

Assay techniques that can be used to detennine levels of a polypeptide of the present invention 
15 in a sample derived from a host are well-known to those of skill in the art and are discussed in 
some detail above (including radioimmunoassays, competitive-binding assays, Western Blot 
analysis and EUSA assays). One example of this aspect of the invention provides a diagnostic 
method which comprises the steps of: (a) contacting a Ugand as described above with a 
biological sample under conditions suitable for the formation of a ligand-polypeptide complex; 
20 and (b) detecting said complex. 

Protocols such as EUSA, RIA, and FACS for measuring polypeptide levels may additionally 
provide a basis for diagnosing altered or abnoraial levels of polypeptide expression. Normal or 
standard values for polypeptide expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, preferably humans, with antibody to the 
25 polypeptide under conditions suitable for complex formation The amount of standard complex 
formation may be quantified by various methods, such as by photometric means. 

Antibodies which specifically bind to a polypeptide of the invention may be used for the 
diagnosis of conditions or diseases characterised by expression of the polypeptide, or in assays 
to monitor patients being treated with the polypeptides, nucleic acid molecules, ligands and 
30 otiier compounds of the invention. Antibodies usefiil for diagnostic purposes may be prepared 
in the same manner as those described above for therapeutics. Diagnostic assays for the 
polypeptide include mediods that utilise the antibody and a label to detect the polypeptide in 
human body fluids or extracts of cells or tissues. The antibodies may be used with or without 
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modification, and may be labeUed by joining them, either covalently or non-covalentiy, with a 
reporter molecule. A wide variety of reporter molecules known in the art may be used, several 
of wWch are described above. 

Quantities of polypeptide expressed in subject, control and disease samples from biopsied 
5 tissues are compared witii the standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. Diagnostic assays may be used to distinguish 
between absence, presence, and excess expression of polypeptide and to monitor regulation of 
polypeptide levels during tiierapeutic intervention. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal.smdies, in clinical trials or 
10 in monitoruig tiie treatment of an individual patient. 

Diagnostic kits 

A diagnostic kit of tiie present invention may comprise: 

(a) a nucleic acid molecule of the present invention; 

(b) a polypeptide of the present invention; or 
15 (c) a ligand of the present invention. 

In one aspect of die invention, a diagnostic kit may comprise a first container containing a 
nucleic acid probe tiiat hybridises under stringent conditions with a nucleic acid molecule 
according to tiie invention; a second container containing primers useful for amplifying the 
nucleic acid molecule; and instructions for using the probe and primers for faciUtating the 
20 diagnosis of disease. The kit may fiirther comprise a third container holding an agent for 
digesting unhybridised RNA. 

In an alternative aspect of the invention, a diagnostic kit may comprise an array of nucleic acid 
molecules, an array of antibody molecules, and/or an array of polypeptide molecules, as 
discussed in more detail above. 
25 Such kits will be of use in diagnosing a disease or susceptibility to disease, particularly 
inflammation, oncology, or cardiovascular disease. 

Various aspects and embodiments of the present invention will now be described in more 
detail by way of example, with particular reference to polypeptides regulated differentially 
under hypoxic conditions as opposed to normoxic conditions. It will be appreciated that 
30 modification of detail may be made witiiout departing firom the scope of die invention. 
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The method used in the following examples for the identification of hypoxia-induced genes 
and proteins utilised an approach herein termed "Smartomics". This method is a differential 
expression screening method for identifying genetic elements that are involved in a particular 
cellular process and is described in detail in a co-pending, co-owned International patent 
5 application PCT/GBOl/00758 entitled -Differential Expression Screening Method". The 
method involves conqjaring: 

(a) gene expression in a first cell of interest; and 

(b) gene expression in a second cell of interest, which ceU comprises altered levels, 
relative to physiological levels, of a biological molecule impUcated in the ceUular 

10 process, due to the introduction into the second cell of a heterologous nucleic acid 

directing expression of a polypeptide; and 
identifying a genetic element whose expression differs, wherein gene expression in said first 
and second cell of interest is compared under at least two different environmental conditions of 
oxygen concentration. 

IS Brief description of the Figures 

Figure 1: Northern blots performed to confirm overexpiession of HIF-la and EPASl using 
adenoviral gene transfer in transduced macrophages. RNA loading was as follows: Lanes 1,2: 
Macrophages transduced with the adenovirus AdApt ires-GFP. Lanes 3,4: Macrophages 
transduced with the adenovirus AdApt HIF-la-ires-GFP. Lanes 4,5: Macrophages transduced 

20 with the adenovirus AdApt EPASl-ires-GFP. In lanes 13^ the macrophages were maintained 
in normoxia (20% O2). In lanes 2,4,6 the macrophages were maintained in hypoxia (0.1% O2). 
Positions of bands fi:om an RNA size ladder are indicated to the right of each blot in Idlobases 
(kb). Hybridisation probes were complemoitary to the graes HIF-la (A), EPASl (B) and 28s 
ribosomal RNA (C). 

25 Figure 2: A scatter plot of two representative RNA samples analysed using Research Genetics 
GeneFilters. RNA from non-transduced macrophages in noimoxia (Y-axis) or hypoxia (X- 
axis) was hybridised to two Research Genetics GeneRlters GF200 arrays. Analysis was output 
as normaHsed intensity for each gene on the array, with two values per gene corresponding to 
the signals from normoxia and hypoxia. These values were plotted as a scatter graph, with each 

30 dot representing a gene on the array. Genes expressed at similar levels between the RNA 
samples are located at the x=y line. In this representation an indication is apparent of the 
dynamic range of detection. 
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Figure 3: Analysis of Lactate Dehydrogenase A expression with Smartomics. In section A. 
thumbnail images of spots corresponding to the lactate dehydrogenase-A (LDH-A) gene are 
shown. Contrast levels were set at a level to allow optimal visualisation of this gene, but are at 
a constant setting throughout this figure. Each strip of 6 images corresponds to a discrete array 

5 position or experiment, over the range of RNA samples. Figures beneath individual spot 
images are ratios of the normalised intensity of that spot compared to the reference condition 
(gfy; 20%O2). Array location: Identity of the spot as defined by Research Genetics. Qone: 
IMAGE identification. The histogram (section B) shows the average of the figures shown and 
error bars are standard deviation, g^: ceUs transduced with AdApt ires-GFP. Hif-la: Cells 

10 transduced with AdApt Hif-la-ires-GFP. Epasl: CeUs transduced with AdApt Epasl-ires- 
GFP. 

Figure 4: Analysis of Glyceraldehyde 3-phosphate dehydrogenase expression with 
Smartomics. In section A, thumbnail images of spots corresponding to the glyceraldehyde 3- 
phosphate dehydrx)genas6 (GAPDH) gene are shown. Contrast levels were set at a level to 

15 allow optimal visuaUsation of this gene, but are at a constant setting throughout fliis figure. 
Each strip of 6 images corresponds to a discrete anay position or experiment, over the range of 
RNA samples. Figures beneath individual spot images are ratios of the normalised intensity of 
that spot compared to the reference condition (gft); 20%O2). Array location: Identity of the spot 
as defined by Research Genetics. Qone: IMAGE identification. The histogram (section B) 

20 shows the average of the figures shown and error bars are standard deviation, gft): cells 
transduced with AdApt ires-GFP. Hif-la: Cells transduced with AdApt Hif-la-ires-GFP. 
Epasl : Cells transduced witii AdApt Epasl-ires-GFP. 

Figure 5: Analysis of Platelet derived growtii factor beta expression witii Smartomics. In 
section A, thumbnail images of spots corresponding to the Platelet derived growfli factor beta 

25 (PDGF Beta) gene are shown. Contrast levels were set at a level to allow optimal visuaUsation 
of this gene, but are at a constant setting throughout this figure. Each strip of 6 images 
conesponds to a discrete array position or experiment, over tiie range of RNA samples. Figures 
beneath individual spot images are ratios of the normalised intensity of that spot compared to 
the reference condition (gip; 20%O2). Array location: Identity of the spot as defined by 

30 Research Genetics. Clone: IMAGE identification. For this gene, it is noted that different 
IMAGE clones corresponding to the same gene are present. The histogram (section B) shows 
the average of the figures shown and error bars are standard deviation, g^: cells transduced 
with AdApt ires-GFP. Hif-la: Cells transduced with AdApt Hif-lotrires-GFP. Epasl: Cells 
transduced witii AdApt Epasl-ires-GFP. 
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Figure 6: Analysis of Monocyte Chemotactic Protein-1 expression with Smartomics. In 
section A. thumbnail images of spots corresponding to the Monocyte Chemotactic Protein-l 
(MCP-1) gene are shown. Contrast levels were set at a level to allow optimal visualisation of 
this gene, but are at a constant setting throughout this figure. Each strip of 6 images 

5 corresponds to a separate experiment, over the range of RNA samples. Figures beneath 
individual spot images are ratios of the normalised intensity of that spot compared to the 
reference condition (g^; mOH. Array location: Identity of the spot as defined by Research 
Genetics. Qone: IMAGE identification. The histogram (section B) shows the average of the 
figures shown and error bars are standard deviation, g^: cells transduced with AdApt ires- 

10 GFP. Hif-la: Cells transduced with AdApt Hif-la-ires-GEP. Epasl: CeUs transduced with 
AdApt Epasl-ires-GFP. 

Figure 7: Discovery of a novel gene (Hs.l6335) using Smartomics. In section A, thumbnail 
images of spots corresponding to the EST firom UniGene cluster Hs.l6335 are shown. Contrast 
levels were set at a level to allow optimal visualisation of this gene, but are at a constant setting 

15 throughout this figure. Vox this gene, contrast levels are at maximum. Each strip of 6 images 
corresponds to a separate experiment, over the range of RNA samples. Figures beneath 
individual spot images arc ratios of the normalised intensity of that spot compared to the 
reference condition (gfp; 20%O2). Array location: Identity of tiie spot as defined by Research 
Genetics. Qone: MAGE identification. The histogram (section B) shows the average of the 

20 figures shown and error bars are standard deviation, g^: cells transduced with AdApt ires- 
' GFP. Hif-la: Cells transduced witii AdApt Hif-la-ires-GFP. Epasl: Cells transduced witii 
AdApt Epasl-ires-GFP. 

Figure 8: Virtual Northern blot hybridisation to vaUdate discovery of Hs.l6335 by 
Smartomics. A) Hybridisation probe = Hs.l6335. B) Hybridisation probe = p actin. Lanes 1-6 
25 are the RNA samples used in Figures 3-7, fi-om cells transduced with adenovirus. Lanes 7-10 
are from non-transduced macrophages with Oanes 9,10) or without Oanes 7,8) prior activation. 
Histograms show relative mRNA expression levels, from phosphorimager analysis, relating to 
the Northern blots positioned above. Figures are relative expression ratios compared to gft) 
(20% O2). 

30 Figure 9: Plasmid map for pSMARTCMV-HIF. 
Figure 10: Plasmid map for pSMART CMV-empty. 
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EXAMPLES 

Example 1: The use of Smartomics for the identification of hypoxia-regulated genes in 
macrophages 

The Smartomics method has been utilised herein to improve the discovery of genes activated 
5 or repressed in response to hypoxia in primary human macrophages. This involves augmenting 
the natural response to hypoxia, by experimentally introducing a key regulator of the hypoxia 
response, namely hypoxia inducible factor la (fUF-la), into a population of primary human 
macrophages and comparing gene expression in these cells with that in control cells. 

Ovetexpression of HIF-la was done either in isolation or was done in combination with 
10 exposing the cells to hypoxia. This allowed the detection of resulting gene expression changes 
that would otherwise have not been detectable in response to hypoxia alone. 

Although HIF-la is well known to mediate responses to hypoxia, other transcription factors 
are also known or suspected to be involved. These include a protem called endothelial PAS 
domain protein 1 (EPASl) or HIF-2a, which shares 48% sequence identity with HIF-la 

15 ("Endothelial PAS domain protein 1 (EPASl). a transcription factor selectively expressed in 
endothelial cells." Tian H, McKnight SL. Russell DW. Genes Dev. 1997 Jan l;ll(l):72-82.). 
Evidence suggests that EPASl is especially important in mediating the hypoxia-response in 
certain cell types, and it is clearly detectable in human macrophages, suggesting a role in tiiis 
cell type (Griffiths et al. Ge,ie Ther. 2000 Feb;7(3):255-62). In the Ught of this, tiie current 

20 example also utilises overexpression of EPASl, as an independent means of improving 
discovery of hypoxia-responsive genes, to overexpression of HIF-la. It also illustrates an 
embodiment of the invention, whereby differences in the response to HIF-la or EPASl (or 
other mediators of the hypoxia response) may be identified, witii die goal of identifying 
therapeutic target molecules more suitable for specific and efficient treatment of disease. 

25 The introduction of foreign gene sequences (i.e. HEF-la or EPASl) to primary macrophages 
may be achieved by recombinant adenovirus. A commercially available system was used 
herein to produce adenoviral particles involving tiie adenoviral transfer vector AdApt, the 
adenoviral genome plasmid AdEasy and the packaging cell Une Per-c6 (Introgene, Leiden, The 
NetiiOTlands). The standard manufactiirK's instructions were followed. 

30 Three derivatives of the AdApt transfer vector have been prepared, named AdApt iics-GFP, 
AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP. In these vectors, for convenience, 
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AdApt was modified such that inserted genes (i.e. fflF-la or EPASl) expressed from the 
powerful cytomegalovirus (CMV) promoter were linked to the green fluorescent protein (gfy) 
marker, by virtue of an internal ribosome entry site (ires). Therefore presence of green 
fluorescence provides a convenient indicator of viral expression of HEF-la or EPASl in 
5 transduced mammalian cells. ' 

Standard molecular biology methods were used to construct the derivatives of AdApt, which 
included reverse transcriptase PGR (RT-PCR). transfer of DNA fragments between plasmids 
by restriction digestion, agarose gel DNA fragment separation, "end repairing" double stranded 
DNA fragments with overhanging ends to produce flush blunt ends, and DNA ligation. 
10 Subcloning steps were confirmed by DNA sequencing. These techniques are weU known in the 
art, but reference may be made in particular to Sambrook et td.. Molecular Qoning, A 
Laboratory Manual (1989) and Ausubel et al.. Short Protocols in Molecular Biology (1999) 4th 
Ed, John Wiley & Sons, Inc. 

Briefly, AdApt ires-GFP was made by inserting the encephalomyocarditis virus EMCV ires 
15 followed by the green fluorescent protein gene (GFP), into the end-repaired Hpal restriction 
site of AdApt, immediately downstream of and in the same orientation as die CMV promoter. 
Botii EMCV ires and gl^ sequences are widely used and can be obtained from commonly 
available plasmids. SEQ ID NO:39 recites the exact nucleotide sequence of the joined ires- 
GFP which was inserted into the AdApt plasmid. 

20 The plasmid AdApt HIF-la-iies-GFP was derived from AdApt ires-GFP by inserting the 
protein coding sequence of human HIF-la between tiie CMV promoter and Ae ires-GFP 
elements of AdApt ires-GFP. To do this, human HIF-la cDNA was cloned by RT-PCR from 
human mRNA, and die sequence was verified by comparison to die pubUshed HIF-la cDNA 
nucleotide sequence (Genbank accession U22431). The HIF-la sequence was ligated as an 

25 end-repaired fragment into tiie end-repaired Agd restriction site of AdApt ires-GFP [this is 
also the Agel restriction site of the parental vector AdApt immediately downstream of the 
CMV promoter]. The exact DNA sequence containing HIF-la that was inserted into AdApt 
ires-GFP is shown in SEQ ID NO: 40. 

The plasmid AdApt EPASl-ires-GFP was derived from AdApt ires-GFP by inserting the 
30 protein coding sequence of human EPASl between the CMV promoter and the ires-GFP 
elements of AdApt ires-GFP. To do this, human EPASl cDNA was cloned by reverse 
transcriptase PCR (RT-PCR) from human mRNA. and the sequence was verified by 
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comparison to the published EPASl cDNA nucleotide sequence (GenBank accession U81984). 
The EPASl sequence was ligated as an end-repaired fragment into the end-repaired Agel 
restriction site of AdApt ires-GFP [this is also the Agel restriction site of the parental vector 
AdApt immediately downstream of the CMV promoter]. The exact DNA sequence containing 
5 EPAS 1 which was inserted into AdApt ires-GFP is shown in SEQ JD N0:41 

The adenoviral transfer vectors AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP, were 
verified prior to production of adenoviral particles, for their ability to drive expression of 
functionally active HIF-la or EPASl protein from the CMV promoter in manunalian cells. 
This was achieved by transient transfection luciferase-reporter assays as described (Boast et 
10 al.. Hum Gene TJier. 1999 Sep l;10(13):2197-208). 

Using the aforementioned Introgene adenoviral system, caesium-banded, pure adenoviral 
particles were produced for each of the vectors AdApt ires-GFP, AdApt HIF-la-ires-GFP and 
AdApt EPASl-ires-GFP. Following the Introgene manual, adenoviral preparations were 
quantitated by spectrophotometry, yielding values of viral particles (VP) per miUiliter. 

15 To isolate human macrophage, monocytes were derived from peripheral blood of healtiiy 
human donors. lOQml bags of buffy coat from the Bristol Blood Transfusion Centre (Bristol, 
UK) were mixed with an equal volume of RPMI1640 medium (Sigma). This was layered on 
top of IQml ficol-paque (Pharmacia) in 50ml centrifuge tubes and centrifuged for 25 min at 
800 X g. The interphase layer was removed, washed in MACS buffer (phosphate buffered 

20 saline pH 7.2, 0.5% bovine serum albumin, 2mM EDTA) and rcsuspended at 80 microliter per 
lOn? cells. To tiiis, 20 microliter CD14 Microbeads (Miltenyi Biotec) were added, and the tube 
incubated at 4 degrees for 1 5 min. Following this, one wash was performed in MACS buffer at 
400 X g and the cells were tesuspended in 3 ml MACS buffer and separated on an LS-i- MACS 
Separation Column (Miltenyi Biotec) positioned on a midi-MACS magnet (Miltenyi Biotec). 

25 The column was washed with 3 x 3ml MACS buffer. The column was removed from flie 
magnet and cells were eluted in 5 ml MACS buffer using a syringe. Cells were washed in 
culture medium (AIM V (Sigma) supplemented with 2% human AB serum (Sigma), and 
resuspended at 2 x lOnS cells per ml in the same medium and placed in large teflon-coated 
culture bags (Sud-Laborbedarf GmbH, 82131 Gauting, Germany) and transferred to a tissue 

30 culture incubator (37 degrees, 5% CO2) for 7-10 days. During this period, monocytes 
spontaneously differentiate to maaophages. This is confirmed by examining cell morphology 
using phase contrast microscopy. Cells arc removed from the bags by placing at 4 degrees for 
30 min and emptying the contents. 
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The macrophages were washed and resuspended in DMEM (Gibco. Paisley. UK) 
supplemented with 4% fetal bovine serum (Sigma). 4x10^ ceUs were plated into individual 
10cm Primeria (Falcon) tissue culture dishes in a total volume of 8 ml per plate, with 6xl0' 
adenoviral particles per ml. Following culture for 16 hr, during which the macrophages adhere 
5 to the plate and are infected by the adenoviral particles, the medium is removed and replaced 
by AIM V medium supplemented with 2% human AB serum. A further 24 hr period of culture 
is allowed prior to experimentation, to allow gene expression from the transduced adenovirus. 

The above dosage of adenoviral particles was determined to be the minimum amount required 
to achieve transduction of the majority (over 80%) of the macrophage population, using green 
10 fluorescence as a marker of gene transfer. This was confirmed using a separate adenovural 
construct containing the LacZ reporter gene. By selecting the minimum dose of virus, possible 
non-specific effects of vural transfer are minimised. 

For experimentation with hypoxia, identical culture dishes were divided into two separate 
incubators: One at 37 degrees, 5% CO2, 95% air (=Normoxia) and the other at 37 degrees, 5% 
15 CO2, 94.9% Nitrogen, 0.1% Oxygen (=Hypoxia). After 8 hours culture under these conditions, 
the dishes were removed ftom the incubator, placed on a chiUed platform, washed in cold PBS 
and total RNA was extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) 
following the manufacturer's instructions. 

The design of this experiment was to obtain six populations of cells (referred to for simplicity 
20 as "ceU types"), differing only in their treatment with adenovirus and/ or hypoxia, as shown 
below: 



"Cell Tvoe" 


Adenovirus 


Expressed peneOxveen condition 


1 


AdApt ires-GFP 


none 


Normoxia 


(20% Oxygen) 


2 


AdApt ires-GFP 


none 


Hypoxia 


(0.1% Oxygen) 


3 


AdApt HIF-lar4res-GFP 


HIF-la 


Normoxia 


(20% Oxygen) 


4 


AdApt HlF-la-ires-GFP 


HIF-la 


Hypoxia 


(0.1% Oxygen) 


5 


AdApt EPASl-ires-GFF 


EPASl 


Normoxia 


(20% Oxygen) 


6 


AdApt BPASl-ires-GFP 


EPAS.l 


Hypoxia 


(0.1% Oxygen) 
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Gene discovery can be implemented by comparing gene expression profiles between these 
"cell types". According to conventional methods available in the hterature, one would make 
comparisons between cell types 2 and 1. By implementing the Smartomics method, several 
other possibilities are seen. Firstly, a comparison can be made between cell types 3 or 5 and 

5 cell type 1. Here, the stimulus of overexpressing key molecules involved in the hypoxia 
response may exceed the natural response the hypoxia, as seen for cell type 2. Secondly, in a 
preferred embodiment of the invention, a comparison can be made between cell types 4 or 6 
and cell type 1. In this situation, the natural response to hypoxia is being augmented or boosted 
by overexpressing key molecules involved in the hypoxia response. It should be noted that the 

10 experimental design illustrated above uses a control adenovirus in place of untreated cells. By 
doing this, any non-specific effects of viral transduction should occur equally throughout the 
analysis, and will disappear. 

Although efficient adenoviral gene transfer was indicated by green fluorescence in the 
transduced macrophages. Northern blotting was used to absolutely confirm overexpression of 

15 HIF-laand EPASl. RNA samples extracted fi-om cell types 1-6 as described above were 
analysed by Northern blotting (Figure 1). The RNA samples (8ug total RNA per lane) were 
electrophoresed on a formaldehyde denaturing 1% agarose gel, then transferred to a nylon 
membrane (Hybond-N. Amersham, UK), and sequentially hybridised with ^^P-labelled DNA 
probes complementary in nucleotide sequence to HIF-la (Figure la), EPASl (Figure lb) or 

20 28S ribosomal RNA (Figure Ic). The methodology used for Northern blotting, probe 
hybridisation under stringpnt conditions, and removal of probes between hybridisations, is well 
known in the art. 

In Figure la, it can be seen that all lanes contain a faint band of approximately 4 kb, 
corresponding to the endogenous HIF-la mRNA. In lanes 3,4. which contain RNA ftom cells 
25 transduced with AdApt HIF-la-ires-GFP, a much stronger band of a similar size is observed, 
indicating successful overexpression of HIF-la. 

In Figure lb, it can be seen that all lanes contain a very faint band of approximately 5 kb. 
corresponding to the endogenous EPASl mRNA. In lanes 5,6, which contain RNA firom cells 
transduced with AdApt EPASl-ires-GFP, a much stronger band at approximately 4 kb is 
30 observed, indicating successfiil overexpression of EPASl. The difference in size of the 
endogenous and overexpressed EPASl is due to the long untranslated region of the 
endogenous gene, which is of no consequence. 
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In Fipre Ic, it can be seen that 28S ribosomal RNA is detected in all lanes, indicating equal 
loading of RNA on the ^1. 

By phosphorimager quantitative analysis of Figures la and lb it is apparent that 
overexpression levels of both HIF-la and EPASl are approximately 80-fold over the 
5 endogenous levels. Adenoviral-directed mRNA overexpression of these genes is not further 
augmented by hypoxia. For example, in Figure la, the band intensity for lane 4 does not 
exceed that for lane 3. However at Ae protein and functional levels, hypoxia potentiates the 
action of the proteins encoded by these mRNAs (Semenza GL. Armu Rev Cell Dev Biol. 
1999;15:551-78. "Regulation of mammalian 02 homeostasis by hypoxia-inducible factor 1"). 
10 Global mRNA expression profiles from the RNA samples isolated from the six "ceU types- 
were obtained using Research Genetics Human GeneFdters Release 1 (GF200) (Research 
Genetics, HuntsviUe, AL). This method uses pre-made arrays of DNA complementary to 5,300 
genes covering a range of levels of characterisation, including sequences which only match 
unannotated ESTs or cDNA sequences of unknown function. 
15 The arrays are nylon in composition, and are spotted with DNA derived from specific IMAGE 
consortium cDNA clones (http-7/image.llnl.gov/image/). The arrays are hybridised to RNA 
samples which have been radioactively labelled with the isotope to measure the abundance 
of individual genes within the RNA samples. Multiple RNA samples arc labelled and 
hybridised in parallel to separate copies of the array, and spot hybridisation signals are 
20 compared between the RNA sanq)les. 

Key issues in array-based mRNA expression analysis are sensitivity and reliability. Currently 
two other methods are available; glass microarrays and DNA chips, both of which utilise 
fluorescently labelled RNA (Bowtell DD. Nat Genet 1999 Jan;21(l Suppl):25-32. '-Options 
available-from start to finish-for obtaining expression data by microarray."). Although these 
25 methods are often believed to offer increased sensitivity over Nylon-based methods, this beUef 
lacks definitive proof. To the contrary, a carefid comparison of the three approaches shows that 
for similar amounts of unampUfied RNA, the nylon-based radioactive method is superior 
(Bertucci F, Bernard K, Loriod B, Chang YC, Granjeaud S, Bimbaum D, Nguyen C, Peck K, 
Jordan BR. Hum Mot Genet. 1999 Sep;8(9):1715-22. "Sensitivity issues in DNA array-based 
30 expression measurements and performance of nylon microarrays for small samples.")- The 
microarray and DNA chip methods require much larger amounts of RNA which are often not 
easily obtained from primary cells, or complicated amplification methods, which are liable to 
introduce error. 
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To demonstrate the sensitivity of the array-based gene expression method used in the cwrent 
exemplification of Smartomics, a scatter plot of two representative RNA samples analysed in 
our laboratory using Research Genetics GeneFilters. demonstrates a range of detection 
approaching 4.1ogs (Figure 2). By comparison, arguably the most sophisticated array-based 
5 method, the DNA chip, is quoted as having a range of detection of 3-logs (Affymetrix). 

Therefore, it is reasonable to assume that the improvements afforded by Smartomics regarding 
sensitivity issues, as Ulustrated by the current exemplification, could not easUy be obtained by 
utilising an alternative array-based method. In any case, any potentially superior array 
methodology could be fimher improved by utilising the Smartomics invention described here. 
10 RNA extracted ftom the 6 "ceU types" as described above, was ladioactively labelled and 
hybridised to separate copies of the Research CJenetics Human GeneFilter GF200 (experiment 
#1). Methods provided by the manufacturer were followed 
(http://wwwjesgen.com/products/GF200_protocol.php3). Images of hybridised arrays were 
obtained using a Molecular Dynamics Storm phosphorimager. RNA was then stripped from the 
1 5 arrays, following the aforementioned protocol. 

To ensure reproducibility, this procedure was repeated with the same RNA samples 
(experiment #2). The entire data set was tiien imported and analysed using Research Genetics 
Pafliways 3.0 software, as explained in tiie Pathways 3.0 manual. Key aspects of the current 
analysis are summarised bdow: 
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Project Tree set-up 

"Condition Pairs" mode was used to simultaneously analyse multiple experiments. "Condition" 
means several arrays hybridised to similar RNA samples, derived from the same "cell type". 





Condition 


"Cell Type" 


Adenovirus 


Oxygen 


Experiment 


5 


1 


1 


AdAptires-GFP 


Normoxia 


1 




1 


1 


AdAptires-GFP 


Normoxia 


2 




2 


2 


AdAptires-GFP 


Hypoxia 


1 




2 


2 


AdAptires-GFP 


Hypoxia 


2 




3 


3 
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Normalisation seuup 

The "all data points" option and Y. Chen algorithm with default settings were selected, as 
20 explained in the Pathways 3.0 manual. The two experiments were treated as separate 
normalisation groups, such that global differences between hybridisation signals from different 
arrays from the same experiment were corrected. 

Comparison analysis 

Pair-wise comparisons were made between condition 2 and condition 1 
25 condition 3 and condition 1 

condition 4 and condition 1 
condition 5 and condition 1 
condition 6 and condition 1 

30 In otiier words, pair-wise comparisons were made using condition 1 (i.e. cell type 1) as the 
reference condition. This corresponds to cells transduced witii the control adenovirus AdApt 
ires-GFP and placed under normal oxygen concentration (normoxia). Comparisons are made in 
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this way for all genes present on the Research Genetics GF200 airay. By comparing 
conditions, the analysis considers data from bofe experiments #1 and #2. 

Filter settings 

Filtering was then done to select genes with expression ratios of above 2.0 for at least one of 
5 the five pair-wise comparisons detailed above. Genes with low signal intensities for all of the 
six conditions were automatically eliminated, using an Intensity E filter of min 0.2, max 1000. 
Genes that did not respond in a reproducible way in experiment #1 and #2, were automatically 
eliminated using the Students t-test filter (90% confidence level). 

Results were output as expression profiles of individual genes, showing normalised signal 
10 intensity and expression ratio. A key advantage of analysis in Pathways 3.0 is that high 
magnification thumbnail images of individual spots are displayed. This allows visual 
verification that the area being measured truly covers the region containing the hybridised 
array spot, and that the spot is real and not a background artefect. 

Minor differences between quantitative data and correspontting thumbnail images are 
15 sometimes seen even though the sampled area is clearly the bona fide array spot For example, 
by eye there might seem to be a small difference between two spots, though the quantitative 
analysis might suggest a larger difference. It should be noted that thumbnail images are not 
normalised to compensate for global differences, and are limited in image quality. Greyscale 
images are inherently limited in their capacity to depict quantitative differences in intensity. 
20 Digital images generated by the Storm phosphorimager cover a linear dynamic range of 
100,000 for a single pixel, whereas printed images can only be depicted as 256 shades of grey. 

Results for three representative knovra hypoxia-regulated genes 

As demonstration that overcxpression of HIF-la or EPASl together with hypoxia exposure is 
superior to using non-transduced hypoxic cells, in terms of discovering bom fide hypoxia- 
25 regulated genes, results are shovm for genes which are already known in the art to be regulated 
in hypoxia. 

Three genes have been selected which are represented as double spots on the Research 
Genetics GF200 array. Therefore, because the whole experiment was repeated, a total of four 
repeat comparisons are possible for these genes. 
30 The lactate dehydrogenase A (LDH-A) gene is known in the art to be activated by hypoxia 
(Webster KA. Mol Cell Biochem. 1987 Sep;77(l): 19-28. "Regulation of glycolytic enzyme 
RNA transcriptional rates by oxygen availability in skeletal muscle cells."). In Figure 3, it can 
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be seen that in response to hypoxia alone (gfp 0.1% Ch) there is on average a 2.24-fold 
increase in mRNA expression compared to normoxia (g^ 20% O2). 

By oveiexpressing HIF-la there is on average a 3.39-fold increase in LDH-A expression, 
providing a significant improvement over the natural response (Figure 3; HIF-la 20% O2). By 
5 utiUsing a preferred embodiment of the Smartomics method, and simultaneously 
overexpressing HF-la in the presence of hypoxia, the average response of LDH-A is elevated 
further to 4.50-fold (Figure 3; HIF-la 0.1% O2). 

In the prior art it has been established that HIF-la is responsible for mediating the hypoxia- 
induced activation of LDH-A Oyer NV, Kotch LE, Agani F, Leung SW. Laughner E, Wenger 

10 RH, Gassmann M, Gearhart JD, Lawler AM, Yu AY, Semenza GI. Genes Dev. 1998 Jan 
15;12(2): 149-62 "Cellular and developmental control of 02 homeostasis by hypoxia-inducible 
factor 1 alpha."). However it has never been envisaged or demonstrated that overexpression of 
HIF-la in a stable manner using viral gene transfer techniques, both with or without 
simultaneous hypoxia, causes secondary changes in gene expression which are niarkedly 

15 greater than the natural hypoxia response. The response to hypoxia of LDH-A is also improved 
by overexpressing EPASl (Figure 3; EPASl), though this is less dramatic than overexpressing 
HIF-la. 

Like LDH-A, the glyceraldehyde 3-phosphate dehydrogenase (GAPDH) gene is known in the 
art to be activated by hypoxia (Webster KA. Mol Cell Biocliem. 1987 Sep;77(l): 19-28. 
20 "Regulation of glycolytic enzyme RNA transcriptional rates by oxygen availability in skeletal 
muscle ceUs."). In Figure 4, it can be seen that in response to hypoxia alone (g^ 0.1% O2) 
there is on average a 1 .52-fbld increase in mRNA expression conqjared to normoxia. 

By overexpressing HIF-la there is on average a 3.33-fold increase in GAPDH expression, 
providing a significant improvement over the natural response (Figure 4; HIF-la 20% O2). By 
25 utilising the fall embodiment of the Smartomics method, and simultaneously overexpressing 
HDF-lam the presence of hypoxia, the average response of GAPDH is elevated further to 
4i7-fold (Figure 4; HIF-la 0.1% O2). 

In the published literature, it has been established that HIF-la is responsible for mediating the 
hypoxia-induced activation of GAPDH Oyer NV, Kotch LE, Agani F, Leung SW, Uughner E, 
30 Wenger RH, Gassmann M. Gearhart JD, Lawler AM, Yu AY. Semenza GL. Genes Dev. 1998 
Jan 15;12(2):149-62 "Cellular and developmental control of 02 homeostasis by hypoxia- 
inducible factor 1 alpha/'). However in die art, it has never been envisaged or demonstrated 
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that overexpression of HIF-lain a stable manner using viral gene transfer techniques, both 
with or without simultaneous hypoxia, causes secondary changes in gene expression which ate 
markedly greater than the natural hypoxia response. 

For GAPDH, it can be seen that overexpression of EPASl (Figure 4; EPASl 20% O2 and 0.1% 
5 O2), has a significantly smaller effect than overexpressing HIF-la. This demonstrates a 
separate embodiment of the Smartomics method, whereby genes are identified which respond 
selectively or preferentially to overexpression of EPASl or HIF-la. 

Platelet derived growth factor beta (PDGF p) is also known in the art to be activated by 
hypoxia (Kourembanas S, Hannan RL, FaDer DV. / Clin Invest. 1990 Aug;86(2):670-4 
10 "Oxygen tension regulates the expression of the platelet-derived growth factor-B chain gene in 
human endothelial cells."). In Figure 5. it can be seen that in response to hypoxia alone {gfp 
0.1% O2) there is on average a 2.14-fold increase in mRNA expression compared to normoxia. 

By overexpressing EPASl, there is on average a 9.28-fold increase in PDGF p expression 
(Figure 5; EPASl 20% O2). providing a large improvement over the natural response. In this 
15 case, the combination of hypoxia and EPASl overexpression does not exceed the response of 
EPASl overexpression alone, indicating saturation of die dose-response (Figure 5; EPASl 
0.1% O2). 

From Figure 5, it is clear that there is a striking specificity in the response of PEXjF p to 
EPASl and HIF-la, in the opposite manner observed for GAPDH. Overexpression of HIF- 
20 la alone has no significant effect on ?DGP p, whereas overexpression of EPASl produces 
large effects. This demonstrates a separate embodiment of the Smartomics method, whereby 
genes are identified which respond selectively or preferentially to overexpression of EPASl or 
HIF-la 

The gene encoding monocyte chemotactic protein 1 (MCP-1) is known in the art to respond to 
25 hypoxia in a negative fashion, by decreasing mRNA expression (Negus RP, Turner L, Burke 
F, Balkwill FR. / Leukoc Biol 1998 Jun;63(6):758-65. "Hypoxia down-regulates MCP-1 
expression: impUcations for macrophage distribution in tumors"). In Figure 6 it can be seen 
that in response to hypoxia alone (gfp 0.1% O2) there is on average a 0.407-fold change (i.e. a 
2.46 fold decrease) in mRNA expression compared to normoxia. 

30 By overexpressing HIF-la, there is on average a 0.243-fold change (i.e. a 4.11-fold decrease) 
in MCP-1 expression, provicKng a significant improvement over the natural response (Rgurc 6; 
HIF-la 20% O2). By utilising a preferred embodiment of the Smartomics method, and 
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Simultaneously overexpressing HIF-lain the presence of hypoxia, the average response of 
MCP-1 is further improved to a 0.112-fold change (i.e. an 8.93-fold decrease) (Figure 6; HIF- 
laO.1% O2). Even more pronounced improvements in the hypoxia-induced inhibition of 
MCP-1 expression are obtained by overexpressing EPASl (Figure 6; EPASl 20% O2 and 
5 0.1% O2). This demonstrates a use of Smartomics to improve the discovery of genes that are 
inhibited or repressed by disease signals. 

The finding that overexpressing HIF-laor EPASl potentiates hypoxia-induced gene 
repression, as exempUfied by MCP-1, is totally without precedent in this field. The structure of 
both HIF-la and EPASl proteins is that they contain transactivation domains but not known 
10 transcriptional repressor domains (Pugh CW, O'Rourke JF. Nagao M, Gleadle JM, RatcUffe PJ. 
J Biol Chem. 1997 Apr 25;272(17): 11205-14. "Activation of hypoxia-inducible factor-l; 
definition of regulatory domains within the alpha subunit.")- 

The results explained above relate to an array gene expression analysis, in which over 50 genes 
were identified as being regulated in hypoxia, from a total set of approximately 5300 genes on 

15 the array. By focusing on genes known in the art to be regulated in hypoxia; and showing how 
the Smartomics method can significantly enhance the response, an argument is provided that 
Smartomics would provide an improved method for the identification of novel bona fide 
hypoxia-regulated genes. In tiie current study, this can also be shown directly, for novel genes 
which were discovered using the Smartomics method, as presented below. Because expression 

20 changes arising from a conventional analysis are also covered in tiiis analysis (i.e. hypoxia / 
normoxia comparisons vwthout viral overexpression), the advantage of the Smartomics 
invention is cleariy demonstrated. 

Table 1 Usts unannotatcd genes or ESTs which were identified in tiiis analysis as being 
activated in response to viral-directed overexpression, but which would not have been 

25 identified from a hypoxia / normoxia comparison as done in the prior art. The final five 
columns of Table 1 show expression ratios compared to cells transduced with AdApt-ires-GFP 
in normojcia. The first of tiiese five columns is the response without Smartomics, and in all 
cases shown here, tiie levels are below significance. The otiier four colunms represent results 
obtained using flie Smartomics metiiod, and significant responses arc seen here. In particular, 

30 in the final rows of tiiis table, novel genes are identified which show large responses to EPASl 
overexpression. 
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Column 1 is the gene title as used in the UniGene database on 16 Feb 2001. Nucleotide and 
protein acessions are fcom the Genbank database. The final five columns show expression 
levels expressed as a ratio compared to cells transduced with AdApt ires-GFP in normoxia. gfp 
H: Expression in cells transduced with AdApt ires-GFP in hypoxia. Hif N: Expression in cells 
5 transduced with AdApt Hif-la-ires-GFP in normoxia. Hif H: Expression in ceUs transduced 
with AdApt Hif-la-ires-GFP in hypoxia. EPAS N: Expression in ceUs transduced with AdApt 
Epasl-ires-GFP in normoxia. EPAS H: Expression in cells transduced with AdApt Epasl-ires- 
GFP in hypoxia. 

Figure 7 shows the expression profile of one of these genes, conespondmg to an EST 
10 (GenBank accession N64734; IMAGE clone 293336). In the UniGene EST database 
(http://www.ncbi.nhn.nih.gov/UniGeneO this EST is currently clustered with only two other 
ESTs with accessions AI051607 (MAGE 1674154) and T87161 (MAGE 293336). The 
UniGene cluster number is Hs.l6335, and it is totally unannotated in the database. Sequence 
analysis shows that this rare sequence is incomplete and lacks information on the protein 
15 coding sequence. In the Ensembl database of human genome project gene annotation 
(http://www.ensembl.orgO blast searches of predicted or confirmed cDNA sequences do not 
identify this EST. It is therefore apparent that from public domain information, die gene 
corresponding to EST MAGE 293336, is a truly novel and unannotated gene. 
In Rgure 7, thumbnaU array spot images are shown at maximal contrast, such that the 
20 background signal is apparent It can be seen that in response to hypoxia alone (gfp 0.1% O2) 
there is on average a 1.4-fold increase in mRNA expression compared to normoxia. However, 
this is not significant, because it is derived from widely different ratios from individual 
experiments (2.41 and 0.46). From the thumbnail images for gft) 20% O2 and g^p 0.1% O2 it is 
evident that expression of the genes under these conditions is below the detection threshold of 
25 the array-based method. However, when the Smartomics invention is used, and EPASl is 
overexpressed using viral gene transfer methods, a clearly detectable response in seen, with 
induction ratios of over 8-fold (Figure 7; EPASl 20% O2 or 0.1% O2). The expression profile 
in Figure 7 also demonstrates a separate embodiment of Smartomics, for the identification of 
genes which respond selectively to HIF-la or EPASl. 
30 To confirm the results presented in Figure 7, a more sensitive method was used to study 
expression of the gene corresponding to MAGE clone 293336. namely virtual Northern 
blotting. It should be noted that this metiiod would not have been suitable for the original 
discovery that MAGE clone 293336 is induced by hypoxia, because virtual Northern blotting 
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and similar methods do not allow simultaneous screening of large numbers of genes. The 
technique is similar to conventional Northern blotting, with the exception that double stranded 
cDNA corresponding to the mRNA population of expressed genes is resolved by 
electrophoresis and blotted onto a nylon membrane. It reUes on a method of cDNA synthesis 

5 which produces fidl length cDNA molecules, which is commercially available (SMART PCR 
cDNA Synthesis Kit; Clontech Laboratories Inc. Palo Alto, CA, USA). 
The method for virtual Northern blotting was followed as described in the instruction manual 
for the SMART PCR cDNA Synthesis Kit. Briefly, 600ng cDNA was synthesised from the six 
RNA samples used for array hybridisation. An additional four RNA samples were also 

10 processed, derived from non-transduced macrophages cultured in nonnoxia and hypoxia (6 
hours at 0.1% O2) both with and without pre-treatment for 16 hours with 100 ng/ml 
Upopolysaccharide (£.coK 026:B6 Sigma, UK) and 1000 u/ml human gamma interferon 
(Sigma, UK). This combination of factors causes macrophage activation, a process key to the 
physiological and pathophysiological actions of the macrophage. All 10 cDNA samples were 

15 resolved on an agarose gel, and alkali transfer onto Hybond N+ membrane 
(AmershamPharmacia, UK) was carried out according to the Hybond N+ instructions. 
Stringent hybridisations with "P-labelled cloned cDNA probes were performed as for standard 
Northern blot hybridisation, which is well known in the art. cDNA probes were radiolabelled 
using a commercially available kit (Prime-a-Gene, Promega, UK). The virtual Northern blot 

20 was hybridised first with the cDNA insert of IMAGE clone 1674154 from UniGene cluster 
Hs.16335 (Figure 8a). The blot was then stripped, by a high temperature / low salt wash, and 
was re-probed with the protein coding region of the human p-actin gene (Figure 8b). 

From Figure 8a, it can be seen tiiat the mRNA corresponding to Hs.16335 is detected as a 
doublet band of approximately 4.5 kb. This gene is strongly induced by adenoviral-directed 

25 overexpression of EPASl Qanes 5,6), consistent with the array data from Figure 7. The higher 
induction ratios in this non-array analysis are due to increased sensitivity afforded by the 
virtual Northern technique. Unlike the array data, expression of Hs.16335 is within the range 
of detection for all RNA samples. Importantly, hypoxia alone is seen to cause an induction 
ratio of approximately 60-fold (Figure 8a; lanes 2, 8). Therefore Hs.16335 is identified as a 

30 bone fide hypoxia-regulated gene, despite being beneath the detection level of an array screen 
in the absence of the Smartomics method used hraein. 

The results in Figure Sa also demonstrate a separate embodiment of the Smartomics method, 
whereby genes are identified which respond selectively or preferentially to overexpression of 
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EPASl or HIF-la. Overexpression of HIF-la causes an induction ratio of 18.9-fold Gane 3), 
whereas overexpression of EPASl causes a much larger induction ratio of 141-fold (lane 5). 
In Figure 8a lane 9, it is shown that activation of macrophages by LPS and TNFa causes a 
10.8-fold increase in expression of the gene corresponding to Hs.l6335. Therefore tins novel 
5 gene is possibly relevant to the inflammatory functions of macrophages. 

In Figure 8b expression of tiie human ^-actin gene is found to be roughly constant tiiroughout 
this experiment, consistent with die differences in Figure 8a being due to specific changes in 
gene expression. 

Rapid amplification of cDNA ends (RACE) has been performed to clone the fuU length 
10 version of die gene corresponding to Hs.l6335, based on the size of the cDNA size on the 
virtual Northern blot Sequencing and functional analysis of this gene will possibly lead to the 
identification of a new therapeutic target molecule. Crucial to this process was the initial use of 
the Smartomics invention. 

Confirmation of targets using hybridisation assays 
15 To confirm the findings presented in Example 1, the IMAGE clones identified from the 
Research Genetics Human GeneFilters have now been fabricated by tiie authors into an 
independentiy produced and verified gene array (referred to herein as the "custom gene 
array"), composed of PCR-amplified insert DNA. The methods used to produce this array are 
common in die art, but the key points are sununarised below. 
20 IMAGE clones were obtained from the UK MRC HGMP Resource Centre (Hinxton, 
Cambridge CBIO ISB, UK) and were re-isolated as individual colonies and sequenced to 
verify the correct identity of tiie clone. In the majority of cases, die same IMAGE clone 
identified from die Research Genetics Human GeneFilters was selected, but in some instances 
these clones were not available and alternatives were selected, corresponding to the same gpne. 

25 Additional genes, with well-defined roles in various disease processes relevant to hypoxia, 
were also represented on die array, as derived from IMAGE clones. It is well established in the 
literature that genes widi similar functions are often co-regulated at the mRNA level, as 
determined by microarray data clustering methods ayer VR et al. Science. 1999 283(5398):83- 
7; Eisen MBetd Proc Natl Acad Sci USA. 1998 95 (25): 14863-8). This allows associations to 

30 be made between genes of unknown function (as present in die current specification) to genes 
of well defined function, in order to add significance to die former. 
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Normalisation is a key issue in airay analysis. The custom gene array is a single colour type 
airay. and contains a selection of additional MAGE clones corresponding to genes which were 
empirically determined not to be affected by hypoxia and which are highly expressed in a wide 
range of human tissues and cell types. During data analysis, spot intensities were divided by 
5 the mean of all the reference genes shown below, each of which was present in quadruplicate 
on each array. 

g^jjg IMAGE clone Acc 

FUl 1 102 fis clone PLACE1005646 AA464704 

10 matrix Gla protein . AA155913 
guanine nucleotide binding protein alpha stimulating 1 R43581 
DKFZp434A1319 W74725 
CDNAFU23280 fis clone HEP07194 AA669443 
betaactin (in house clone) 

15 EFla-like protein AI817566 
ribosomal protein L37a W91881 

IMAGE clone plasmid miniprep DNA was prepared and PGR amplified with flanking vector 
primers of the sequences GTTTTCCCAGTCACGACGITG and 
TGAGCGGATAACAATTTCACACAG. This was then purified and concentrated by ethanol 
20 precipitation, and the presence of a single band and DNA concentration were determined by 
agarose gel electrophoresis and by digital imaging methods. 

Purified PGR product corresponding to all the clones (IMAGE and non-IMAGE) were 
normalised to 0.5 mg/ ml by dilution. Arrays were fabricated onto Hybond N+ (Amersham) 
membranes using a BioRobotics TAS anayer (Biorobotics. Cambridge CB37LW. UK) with a 
25 500 micron pin tool. Using 384-weU source plates and a 2x2 arraying format this array was 
relatively low density, thereby eliminating problems of spot-to-spot signal bleed. Also the 
large pin size and high source plate DNA concentration improves the sensitivity of detection. 
Post-arraying denaturation/ neutralisation was essentially as described by Bertucci F et al.. 
1999 {Oncogene 18: 3905-3912). 

30 Total RNA was extracted from ceUs using RNeasy (Qiagen) and 7 micrograms RNA was 
labelled with 100 microCi 33P dCTP using 2 micrograms poly dT (10-20 mer) as primer in a 
reverse transcription reaction. First strand RNA was then degraded under alkaline contitions. 
and this was then neutralised with Tris HCl pH 8.0. and the labelled cDNA was purified using 
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BioRad BioSpin-6 chromatography columns. Pre-hybridisation was performed in 4 ml 
Research Genetics MicroHyb solution supplemented with lOmicrograms poly dA (10-20 mer) 
and 10 micrograms Cot-1 DNA, at 45 degrees for 2-3 hours. The cDNA was then denatured by 
heating and added to the pre-hybridisation, which was continued for 18-20hr. Washing steps 
5 were done as foUows: 2xSSC/ 1% SDS 2x20min at 50 degrees and 0.5xSSa 1% SDS lOmin 
at 55 degrees. Airays were exposed to Amersham Low Energy phosphor screens for 24hr and 
scamied using a phosphorimager at 50 micron resolution. Image analysis was done using 
ArrayVision software. (Imaging Research Inc). Tab delimited data files were exported and a 
full analysis performed using GeneSpring software (Silicon Genetics). 
10 Using the described methodology a dynamic range of detection of 4 logs and a sensitivity of at 
least 1 / 50,000 is obtained, as detennined by spike doping titration experiments. Having 
several technical differences compared to the Research Genetics Human GeneFilters as used in 
the initial filing, data ftom the custom gene array is expected to be quantitatively different. 
As explained earUer, hypoxia may be associated witii several inflammatory conditions, 
15 including rheumatoid arthritis. Moreover, other diseases associated witii hypoxia, including 
cancer and atherosclerosis are accepted as having important inflammatory aspects and hypoxia 
itself can modulate immune cell functions of the macrophage (Uwis IS et aL, 1999 
ILeukocyte Biol. 66: 889-900). As cytokines are central to tiie regulation of immune 
processes, the utility of hypoxia-regulated genes regarding inflammatory conditions may be 
20 supported if those genes also respond to key cytokines. In a series of experiments, primary 
human macrophages, cultiired as described, were treated with recombinant human cytokines at 
100 ng/ml, and gene expression changes were determined. In the table below, expression 
values are shown corresponding to tiie fold change compared to untreated cells (all in 
normoxia). Values of less than 1 .0 represent decreased gene expression. 
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2.22 


0.88 


1.04 


0.94 


1.11 


1.07 


O.o2 


U.oJ 


plL13 


19 


N80371 


1.69 


1.52 


2.22 


1.60 


0.82 


0.92 


1.03 


1.27 


1.13 


1.00 


1.29 


A OC 


1.5j 


pi© 


21 


R09498 


0.68 


0.98 


0. It 




1 IS 


iO 




1.08 


1.07 


0.85 


1.08 


0.77 


0.91 


plLlS 


29 


R06745 


0.92 


0.83 


0.81 


2.83 


1.46 


1.61 


1.08 


0.84 


1.32 


1.10 


0.99 


0.85 


1.05 


plM14 


33 


185201 


0.79 


0.83 


0.84 


1.73 


1.25 


1.74 


0.88 


O.70 


1.W 


0.78 


0.71 


0.71 


0.78 


plM2 


3 


R02569 


0.89 


0.77 


0.68 


^67 


1.14 


1.29 


0.92 


0.39 


\M 


1.02 


0J5 


o.6e 


0.64 


plL5 


1 


R11658 


0.6S 


0.62 


0.6S 


1.49 


0.63 


0.94 


1.0f 


0.8f 


Q5^i 


0.5S 


1 0.53 


0.68 


0.78 


plI6 


23 


N74648 


0.8f 


0.5' 


0.6i 


; I.IS 


0.6i 


0.91 


1.2( 


0.8i 


0.8: 


\ O.S'! 


0.54 


0.65 


) 0.79 



We present below that in additional experiments two of the genes in particular have expression 
proffles supporting a novel role in the response to inflammatory stimuli. These genes are 

5 encoded by IMAGE clone accession N80371 (Seq ID:19/20; clone ID:plL13) and by IMAGE 
clone accession N64734 (Seq ID:31/32; clone ID:plM13). The response of macrophages to a 
combination of Upopolysaccharide (LPS; E.coU 026:B6 lOOng/ml from Sigma) and gamma 
interferon (IFNr. lOOng/ml) induces particularly potent cell activation. Conversely, exposure 
to the cytokine IL-10 causes de-activation. To investigate the effects of these cytokines on 

10 macrophages in mote detail, expression changes were determined in both hypoxia and 
normoxia, with and without these cytokine combinations for a period of 6 hours. 
IMAGE clone accession N64734 (Seq ID:31/32; clone ID:plM13) showed the following gene 
expression changes, compared with untreated cells: 

A) in response to hypoxia, gene expression was increased 4.38-fold. 

15 B) in response to LPS/IFNy in normoxia, gene expression was not significantly changed 

C) in response to LPS/IFNy in hypoxia, gene expression was increased 3.33-fold. 

D) in response to IL-10 in normoxia, gene expression was increased 2.16-fold 

E) in response to IL-1 0 in hypoxia, gene expression was increased 1 1 .4-fold 
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This profile indicates responsiveness to hypoxia or to IL-10 alone, and a synergistic effect 
when both stimuli are combined, but no response to LPS/IFNy . 

IMAGE clone accession N80371 (Seq K):19/20; clone ID:plL13) showed the following gene 
expression changes, compared with untreated cells: 
5 A) in response to hypoxia, gene expression was increased 2.85-fold. 

B) in response to LPS/IFNt in nonnoxia, gene expression was not significanUy changed 

C) in response to LPS/IFN7 in hypoxia, gene expression was increased 3.00-fold. 

D) in response to IL-1 0 in normoxia, gene expression was increased 1.31-fold 

E) in response to IL-1 0 in hypoxia, gene expression was increased 3.11-fold 

10 Superoxide radicals arc frequently found at sites of inflammation and can be generated in the 
laboratory using a combination of Xanthine oxidase (10 mU/ml) and Xanthine (0.5 mM). 
Following treatment of macrophages for 18hr, IMAGE clone accession N64734 (Seq 
10:31/32; clone DD^IMIS) was induced in expression by 3.55-fold, and IMAGE clone 
accession N80371 (Seq ID:19/20; clone ID:plL13) was induced in expression by 2.0-fold. 

15 Additionally. IMAGE clone accession N64734 (Seq ID:31/32; clone ID:plM13) was induced 
in expression in response to 0.5 mM H2O2 by 1.63-fold. 
Example 2: Generation of codon-optimised EIAV vector expressing HIFl-a 
This example describes the generation of an EIAV-derived vector, pSMART CMV-HIF in 
which expression of HIF-la is driven from a CMV promoter located intemaDy within the 

20 vector (Figure 9). A similar vector backbone could be used to achieve expression of other 
genes for the purposes of differential screening as described in this patent. 
The starting point for construction of pSMART CMV-HIF was pONY4.0Z (WO 99/32646) 
and Mitophanous et al.. Gene Ther. 1999 Nov;6(ll): 1808-1 8. In the first step, plasmid 
pONY4.0Z was converted into pONYS.OZ by introducing mutations which 1) prevented 

25 expression of TAT by creating an 83nt deletion in exon 2 of tat, 2) prevented 82 ORF 
expression by a 51nt deletion, 3) prevented REV expression by deletion of a single base within 
exon 1 of rev, and 4) prevented expression of the N-terminal portion of gag by insertion of T 
residues within the first and second ATG codons of the gag region, thereby changing the 
sequence to ATTG fit>m ATG. With respect to the wild type EIAV sequence (Accession No. 
30 U01866) these correspond to deletion of 1) nt 5234-5316 inclusive, 2) nt 5346-5396 inclusive, 
and 3) nt 5538. The insertion of T residues (4)) was after nt 526 and nt 543. These alterations 
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were carried out using techniques readily practicable to one skilled in the art The resulting 
vector. pONYS.OZ expresses none of the EIAV accessory proteins or any of the EIAV gag 
protein. 

In the next step, the p-galactosidase reporter gene present in pONY8.0Z was replaced by the 
5 enhanced green fluorescence protein (eGFP) reporter gene to create pONYSG. This was done 
by transfeiTing the SacH -Kpnl fragment corresponding to the GFP gene and flanking 
sequences from pONY2.13GFP (WO 99/32646) into pONYS.OZ cut with the same enzymes. 
The presence of sequences termed the centtal polypurine tract and central tennination 
sequence (cPPT/CTS) has been suggested to improve the efficiency of gene deUvery by fflV-l 
10 based vectors to non-dividing cells (Zennou et d.. Cell. 2000 Apr 14;101(2):173-85, FoUenzi 
et al, Nat Genet. 2000 Jun;25(2):2 17-22). The analogous cis-acting element of EIAV is 
located in the polymerase coding region and can be obtained as a functional element by using 
PGR amplification from any plasmid which contains the EIAV polymerase coding region (for 
example pONY3.1, WO 99/32646) as foUows. The PGR product includes the central 
15 polypurine tract and the central tennination sequence (CTS). The oligonucleotide primers used 
in the PGR reaction were: 

EIAV cPPTPOS: CAGGTrATTCTAGAGTCGACGCTCTCATTACrTGTAAC 
EIAV cPPTNEG: CGAATGGGTTCTAGAGTGGAGCATGTTCACGAGGGATnTG 

The recognition sequence for Xbal is shown in bold face and aUows insertion into the 
20 pONYSG backbone. Before insertion of the cPPT/CTS PGR product prepared as described 
above, pONYSG was modified to remove the central termination sequence (CTS) which was 
already present in tiie pONYSG vector. This was achieved by subcloning the SaO. to Seal 
fragment encompassing the CTS and RRE region from pONYS.OZ into pSP72, prepared for 
ligation by digestion with SaR and EcoRV. The CTS region was then excised by digestion 

25 with Kpnl and PpwNfl, the overhanging ends 'blunted' by T4 DNA polymerase treatment and 
then the ends reUgated. The modified EIAV vector fi:agment was tiien excised using Safl and 
Nhel and Ugated into pONYSG prepared for ligation by digestion with the same enzymes. 
This new EIAV vector was termed pONYSG del CTS. pONYSG del CTS has two Xbal sites 
which flank the CMV-GFP cassette and die PGR product representing the cPPT/CTS, after 

30 digestion widi ICbal can be ligated into either site after partial digestion. ligation into these 
sites results in plasmids with tiie cPPT/CTS element in either tiie positive or negative senses. 
Clones in which the cPPT/CTS was in the positive sense (fimcdonally active) at either the 5* or 
3'-position were teimed pONYSG 5T0S del CTS and pONYSG 3'POS del CTS, respectively. 
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Another vector, termed pONYSZ 5'POS del CTS was also made following a similar strategy to 
that used to make pONYSG 5'POS del CTS. Accordingly, the CTS sequence present in 
pONYS.OZ was removed in the same way to make pONY8Z del CTS and the cPPT/CTS 
sequence was introduced into the unique Xbal site just upstream of the CMV promoter in 
5 pONYSZdelCTS. 

The pSMART CMV-HIF vector plasmid was derived from pONYSG 5'POS del CTS by 
replacement of the coding region for eGFP with that of HIF-la. This was achieved by 
digestion of the latter with SacH and Notl, which flank the eGFP gene, and ligation to a SacR- 
Notl fragment obtained from plasmid AdApt HIF-la-ires-GFP. Construction of plasmid 

10 AdApt HIF-la-ires-GFP is as described in Example 2 above. 

An additional derivative of pONY8G 5'POS del CTS was also made in order to produce vector 
preparations which serve as 'negative controls' in transduction experiments. This vector 
termed, pSMART CMV-empty (Figure 10) was made by digestion of pONYSG 5'POS del 
CTS with BsniBl and NoO, which flank the eGFP gene, followed by leligation. On the basis of 

15 , sequence analysis of the transcript driven by the internal promoter, only a 3 amino acid peptide 
is expected to be produced in cells transduced with this vector. 

The EIAV vectors described above were produced by transient co-transfection of 293T human 
embryonic kidney cells with either vector plasmid, pONYS.l (which expresses the EIAV 
gag/pol protein) and an envelope expression plasmid, pRV67 (which encodes the vesicular 

20 stomatitis virus protein G, VSV-G) using the calcium phosphate precipitatibn method. 

Twenty four hours before transfection the 293T ceUs were seeded at 3.6 x 10^ cells per 10cm 
dish in 10ml of DMEM supplemented with glutamine, non-essential amino acids and 10% 
foetal calf serum. Transfections were carried out m the late afternoon and the cells were 
incubated overnight prior to replacement of the medium with 6ml of fresh media supplemented 

25 with sodium butyrate (5mM). After 7 hours the medium was coUected and 6ml of frssh 
unsupplemented media added to the cells. The coUected medium was cleared by low speed 
centrifugation and then filtered through 0.4micron filters. 

Vector particles were then concentrated by low speed centrifugation (6,000g, JLA10.500 rotor) 
overnight at A°C and the supernatant poured off, leaving the pellet in the bottom of the tube. 
30 The following morning the remaining tissue culture fluid was harvested, cleared and filtered. 
It was then placed on top of die pellet previously collected and overnight centrifugation 
repeated. After this the supernatant was decanted and excess fluid was dramed. Then the 
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pellet was resuspended in formulation buffer to 1/1000 of the volume of starting supernatant. 
Aliquots were then stored at -80°C. 
Formulation bufBer (100ml) 
Tissue culture grade water 28.65ml 
5 19.75mMTris/Ha buffer pH 7.0 19.75ml of a O.IM solution 
40mg/ml lactose 26.6ml of a 150mg/ml solution 

37.5mM sodium chloride 24.4ml of a 154mM solution 

Img/ml human serum albumin" 500^1 of a 20% solution 

5n]/ml protamine sulphate*" lOOjil of a Smg^ml solution 

10 'Human serum albumin (20%) (Albutein. Alpha therapeutics UK Ud, Thetford, Norfolk). 

•"Protamine sulphate 5mg/ml (Prosulf, CP Pharmaceuticals, Wrexham, UK). 

The sequence of pSMART CMV-HIF is presented in SEQ ID NO:42 

The sequence of pSMART CMV-empty is presented in SEQ ID NO:43 

Example 3: Gene identrication in hippocampai neurones 
15 As discussed above in Example 1. hypoxia is an important component of stroke (cerebral 

ischaemia). The Smartomics method has now been utilised to improve the discovery of genes 

activated or repressed in response to hypoxia in primary rat hippocampai neurones. This 

involves augmenting the natural response to hypoxia, by experimentally introducing a key 

regulator of the hypoxia response, namely hypoxia inducible factor la (HIF-la). The 
20 overexpression of HIF-la in combination witii exposure of the cells to hypoxia has allowed 

the detection of gene expression changes which would not been detectable in response to 

overexpression of HIF-la alone, or hypoxia alone. 

Primary rat hippocampai neuron cultures were established according to standard procedures 
from embryonic rats (Dunnett SB, BjorUand A (Eds.) 1992. Neural Transplantation, A 

25 Practical Approach. IRL Press). Briefly, timed-pregnant Wistar rats at eighteen days of 
gestation were anaestiietised witii 0.7 ml isofluorane and killed by cervical dislocation. Pups 
were removed from Uie uteras and decapitated. Hippocampi wrae dissected and stored on ice 
in Hanks Buffered Saline Solution (HBSS) containing DNAse (0.05%) and glucose (2 mM) 
before incubation in trypsin (0.1%) plus DNAse (0.05%) for 5 minutes. After incubation, 

30 trypsin was inactivated by the addition of soybean trypsin inhibitor (SBTI. 0.1%) and ihe 
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solution genUy triturated. CeUs were pelleted by centrifugation (3000 rpm, 5 minutes) and the 
trypsin removed. Cells were then washed twice in HBSS containing SBTI and DNAse 
(0.05%). and iB-peUeted before final suspension in Dulbecco's Modified Eagle's Medium 
(DMEM) containing foetal calf serum (10%), glutamine (2 mM), and gentamicin (0.1 mg-ml" 

5 '). Cells (3 X 10* cells per dish) were plated out onto 60 mm dishes coated with poly-D-Lysine 
(50 ng.ml-*) and fibronectin adhesion promoting peptide (10 fig-ml*'). Cultures were placed 
into a humidified 3TC incubator containing 5% CO2 and twelve hours after plating, 50% of 
the plating medium was replaced with Neurobasal Media (Brewer GJ. 1995. Serum-free 
B27/neurobasal medium supports differentiated growth of neurons from the striatum, 

10 substantia- nigra, septum, cerebral cortex, cerebellum, and dentate gyrus. Journal of 
Neurosdence Research 42:674-83) supplemented with B27 and glutamine (2 mM). Cultures 
were fed every two days with supplemented neurobasal medium and were transduced on day 3 
in vitro. 

Transduction was carried out in supplemented neurobasal media containing polybrcne (2 
15 Mg.ml-'), in 0.5 volumes of the typical culture media volume. Five hours after the onset of 
transduction, the media volume was increased by a factor of 2, and was replaced 12 hours later. 
The viruses pSMART CMV-HIF (carrying the HEF-la gene; see Example 3), pSMART CMV- 
empty (an empty genome used as a control; see Example 3) and pONYSZ 5'POS del CIS 
(containing the P-galactosidase gene) were produced in parallel according to methods detailed 
20 above. The pONYSZ 5'POS del CTS was used to calculate viral titer in D17 cells and in 
hippocampal neurons. Comparison of the RNA packaging signal by quantitative RT-PCR 
(Taqman) of the three viral preps, allowed the biological titers of pSMART CMV-HIF and 
pSMART CMV-empty viruses to be estimated relative to that pONYSZ 5'POS del CTS. All 
transductions were done using approximately equal multiplicity of infections (MOIs) for both 
25 viruses, and the MOI used in each experiment was at least ten. 

Thirty-six hours after transduction, identical culture dishes were divided into two separate 
incubators, one at 37»C. 5% CO2, 95% air (=Normoxia) and the other at 37»C, 5% CO2. 94.9% 
Nitrogen, 0.1% Oxygen (=Hypoxia). After 6 hours culture under these conditions, the dishes 
were removed from the incubator, placed on a chilled platfonn, washed in cold PBS and total 
30 RNA was extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following 
the manufacturer's instructions. 

The experiment yielded four samples, differing only in their treatment with Antivirus and/or 
hypoxia, as shown below: 
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Sample 

1 
2 



3 

5 4 



Lenti virus 

pSMART CMV-empty 
pSMART CMV-empty 

pSMART CMV-HIF 
pSMART CMV-HE' 



71 

F.v pressed gene 

none 
none 

HIF-la 
HIF-la 



Oxygen condition 

Nonnoxia 
Hypoxia 

Normoxia 
Hypoxia 



Gene discovery can be implemented by comparing gene expression profiles between these 
samples. According to conventional methods pubUshed in the art, one would make 
comparisons between cell types 1 and 2. By implementing the Smartomics method, several 
10 other possibilities art seen. Firstly, a comparison can be made between cell types 1 and 3. 
Here, the stimulus of overexpiessing key molecules involved in the hypoxia response may 
exceed the natural response to hypoxia, as seen for cell type 2. Secondly, a comparison can be 
made between cell types 1 and 4. In this sitoation the natural response to hypoxia is being 
augmented or boosted by overexpressing key molecules involved in the hypoxia response. 
15 Global mRNA expression profiles from the RNA isolated from the four samples were obtained 
using the Research Genetics Rat GeneFilter GF300 (Research Genetics, HuntsviUe. AL). This 
method uses pre-made nylon arrays of DNA derived from I.M.A.G.EyLLNL cDNA clones 
containing the 3' ends of genes (http://image.llnl.gov/imageO. The arrays include more than 
5,000 genes covering a range of levels of characterisation, including sequences which are 
20 representative of unannotated ESTs or cDNA sequences of unknown fimction. 

RNA extracted from the 4 samples described above, was radioactively labelled and hybridised 
to separate copies of the Research Genetics Rat GeneHlter GF300. Methods provided by the 
manufacturer were followed (http://www jesgen.conj/products/GF200j)rotocol.php3) with the 
following modifications; RNAsin was added to the labelling reaction, and foUowing labelUng 
25 the mRNA/cDNA hybrid was denatured by incubation with 45mM EDTA/18mM NaOH at 
eS^C for 30 minutes. 

Images of hybridised arrays were obtained using a Molecular Dynamics Storm 
phosphorimager. RNA was then stripped from the arrays, following the aforementioned 
protocol. To ensure reproducibility, this procedure was repeated with the same RNA samples. 
30 Both data sets were then imported and analysed using Research Genetics Pathways 3.0 
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software, as explained in the Pathways 3.0 manual. Key aspects of the current analysis are 
suirunarised below: 
Project Tree set-up 

"Condition Pairs" mode was used to simultaneously analyse multiple experiments. In this 
5 context a condition is equivalent to a sample (e.g. Sample 3, overexpression of HIF-la in 
normoxia). 
Normalisation set-up 

Data point normalisation was selected, as explained in tiie Pathways 3.0 manual. This 
technique generates normalised intensities by dividing aU sampled intensities by the mean 
10 sampled intensity of all clones (except the control points) on the array. The two exp^ents 
were treated as separate normalisation groups, such that global differences in hybridisation 
signals between different arrays within the same experiment were corrected for. 

Comparison analysis 

Condition 1 (i.e. Sample 1) corresponds to cells transduced with the control lentivirus and 
15 placed under normal oxygen concentrations (normoxia). This was used as the reference 
condition in pairwise comparisons witii conditions 2, 3 and 4 (i.e. samples 2, 3 and 4). 
Comparisons were made in this way for all genes present on the Research Genetics GF300 
array. By comparing conditions the analysis considers data from both experiments. 

Results for four representative fouwn HIF-la/kypoxia-regulated genes 
20 As demonstration that overexpression of HIF-la in hypoxic cells is superior to using non- 
transduced hypoxic ceBs or overexpression of HBF-la in normoxic cells, in terms of 
discovering bona fide hypoxia-regulated genes, results are shown below for genes which are 
already known in the art to be regulated by hypoxia and HIF-la. Ratios are expressed as 
average ratios of normalised intensities. 
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Table 2. Respotise of known HIF-la/hypoxia-regulated genes 





PROTEIN 


NUCLEOTIDE 


RATIO SAK 


IPLE 1 (norm 


oxia) vs 


TITLE 


SEQ 
ID 


ACCESSION 


SEQ 
ID 


ACCESSION 


SAMPLE2 


SAMPLES 


SAMPI 
4 












(hypoxia) 


(Hif+normox 


(Hif+hyj 

xia) 


















Enolase I, alpha 




NP_036686 




NM.0I2554 


1.04 


0.86 


1.40 


Glucose-transporter protein 




AAA41248 




MI3979 


1.41 


0.78 


2.14 


Glyceraldehyde-3-phosphate 
dehydrogenase 




AAA40814 




M29341 


1.13 


1.42 


1.67 


Lactate dehydrogenase A 




CAA26000 1 


X01964 


1.36 


1.50 


1.77 



All four genes Usted in Table 2 are known in the art to be regulated by hypoxia, and 
5 specifically by Hif-la (Iyer et al (1998) Cellular and developmental control of O2 homeostasis 
by hypoxia-inducible factor la. Genes Dev 12:149-162). In the case of Enolase 1, alpha, the 
response to hypoxia or overexpression of Hif-la under normoxia is undetectable. It is only 
when Hif-la is overexpressed under hypoxia that an increase in expression level relative to 
normoxia is detected. In the case of glucose-transporter protein the detectable response to 
10 hypoxia is increased by the overexpression of Hif-la in hypoxia. In the case of both 
glyceraldehyde-3-phosphate dehydrogenase and Lactate dehydrogenase A the response to 
hypoxia is detectable, but it is increased by the overexpression of Hif-la under normoxia, and 
even more so by the overexpression of Hif-la under hypoxia. 

Filter settings 

15 Data filtering was then performed to reduce the data set and select genes with expression ratios 
of above 2.0 for at least one of the three pair-wise comparisons detailed above. Genes with low 
signal intensities in all four conditions were automatically eUminated, using an Intensity H 
filter minimum of 0.2. Genes which did not respond in a reproducible way in both experiments 
were automatically eliminated using the Students t-test filter (90% confidence level). 

20 Results were output as expression profiles of individual genes, showing normalised signal 
intensity and expression ratio. A key advantage of analysis in Pathways 3.0 is that high 
magnification thumbnaU images of individual spots from the original images are displayed. 
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This aUows visual verification that the area being measured truly covers the region containing 

the hybridised array spot 

Annotation of known and novel genes 

As demonstration that overexpression of HIF-la in hypoxic cells is superior to using non- 
5 transduced hypoxic cells or overexpression of HIF-la in normoxic ceUs, in terms of 
discovering novel hypoxia-regulated genes, results are shown below for a gene which is 
already known in the art to be regulated by hypoxia, but not by HIF-la, and for an 
unannotated gene. Ratios are expressed as average ratios of normalised intensities. 

Table 3. Response of novel HIF-la regulated genes 



PROTEIN 



NUCLEOTIDE 



RATIO SAMPLE 1 (normoxia) vs 



TITLE 



SEQID 



ACCESSION 



SEQID 



ACCESSION 



SAMPLE 2 



SAMPLE 3 



SAMPLE 4 



(hypoxia) 



(Hif+ 
nonnoxia) 



(Hif+ 
hypoxia) 



Metallothionein-I " 



37 



AAA41590 



38 



J0O75O 



1.61 



1.24 



3.49 



EST 



35 



none 



36 



AA901269 



1.43 



1.08 



3.47 



10 "representative metallothionein ESTs are spotted twice on the array, so the data is the average 
of two points 



Metallothionein-I is known in the literature to be regulated by hypoxia (Murphy et d (1999) 
Activation of metallothionein gene expression by hypoxia involves metal response elements 

15 and metal transcription factor-1. Cancer Res 59(6): 13 15-22). but it is not known to be 
regulated by HIF-la. The data in Table 3 show that the response to overexpression of HIF-la 
in hypoxia greatly exceeds that of hypoxia alone or the overexpression of HIF-la in nonnoxia. 
The EST (expressed sequence tag) is a completely unannotated DNA sequence. Similariy, the 
data in Table 3 show that the response to overexpression of HIF-la in hypoxia greatly exceeds 

20 that of hypoxia alone or flie overexpression of HIF-la in normoxia. 

This data demonstrates that the methods described above enable the further functional 
annotation of known genes and the functional annotation of completely unannotated novel 
genes with no knovra fdncdon. 
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Pvgm plft 4; Discussiop nf relevance of individual clones 

The nucleotide sequence splice variant represented of MEGS, otherwise known as Gtl2. is 
located within an imprinting cluster on human Chromosome 14q32 (Wylie et d 2000, Genome 
Research vol 10 ppl711-8). The MEG3 gene contains no substantial open reading frame and 

5 apparently does not encode a protein. The nucleotide sequence of MEG3 is represented in the 
public sequence databases by the accession AB032607 and is described herein as SEQ ID No. 
2. The Est R11658 represents a novel exon of the MEG3 gene of which tiiere are many within 
the imprinting cluster. This maternally expressed imprinted gene may function as an RNA 
molecule (Schuster-Gossler et al 1998, Developmental Dynamics vol 212 pp214-28). It has 

10 been shown recently that such RNA molecules may control diicctiy the expression of 
imprinted genes (Sleutels 2002, Nature vol 415 pp810-813). Our discovery that this gene is 
inducible by the hypoxia-related transcription factor EPASl raises the likeUhood of its 
involvement in cell survival under stress. 

The protein sequence encoded by hypothetical protein KIAA1696 is represented in the pubUc 
15 databases by tiie accession XP_051010 and is described herein as SEQ ID No. 3. The 
nucleotide sequence is represented in the public sequence databases by tiie accession 
XM_051010 and is described herein as SEQ ID No. 4. KIAA1696 contains a PHD Zmc finger 
motif that is commonly found in nuclear proteins thought to be involved in chromatin- 
mediated transcriptional regulation. Thus our observation of EPASl -inducibility enables the 
20 definition of a role for tiiis protein in tiie regulation of genes necessary for ceD survival under 
conditions of oxygen deprivation. 

The FOtein sequence encoded by EGLN3 is rei»resented in die public databases by tiie 
accession NP_a71356 and is described herein as SEQ ID No. 5. The nucleotide sequence is 
represented in tiie public sequence databases by tiie accession NM_022073 and is described 

25 herein as SEQ ID No. 6. EGLN3 is a member of tiie hif prolyl hydroxylase family of enzymes, 
which are involved in tiie degradation of transcription factors such as Hifl alpha (Bruick et ed 
2001, Science vol 294 ppl337-40). The prolyl hydroxylation activity of EGLN3 can be 
inhibited by tiie use of oxogluterate analogs. The residues His 135 and Asp 137 are key 
residues in die active site of die enzyme and tiierefore mutation of eitiier of tiiese residues will 

30 lead to a loss of function. Our observation of die strong inducibility of tiiis gene by EPASl 
enables tiie definition of its ix)le in die hypoxic response, providing tiie means for rapid 
adaptation of cells to normoxic conditions. 
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The protein sequence encoded by the spUce variant of PHPSl is represented in the public 
databases by the accession XP_096898 and is described herein as the SEQ ID No. 7. The 
nucleotide sequence is represented in the pubUc sequence databases by the accession 
BE671816 and is described herein as SEQ ID No. 8. The protein encoded by the spUce variant 
5 of PHPSl contains a transmembrane domain and has an unknown function. Our observation of 
its Hifla-inducibility enables the definition of its role in cell survival under conditions of 
oxygen deprivation. Its transmembrane domain increases the likelihood of effective therapeutic 
intervention. 

SEQ ID No. 10 corresponds to a singleton EST that is represented in die public databases by 
10 the accession N68173. Our observation of its EPASl-inducibility enables the definition of its 
role in cell survival under conditions of oxygen deprivation. 

The protein sequence encoded by IL6R is represented in the public databases by the accession 
NP_000556 and is described herein as SEQ ID No. 11. The nucleotide sequence is represented 
in the public sequence databases by the accession NM_000565 and is described herein as SEQ 

15 ID No. 12. Elevated levels of IL6 and the IL6 receptor (IL6R) have recenfly been shown to be 
associated with cardiac disease (Plenz et al 2001. Eur J Heart Failure vol 3 pp415.21). Also, 
the ablation of IL6R activity is known to be effective in the treatment of inflammatory 
conditions (Nishimoto et al 2000, Annals of the Rheumatic Diseases 2000 vol 59 Suppl 1 
pp21-7). Our observation of the EPASl-inducibility of IL6R enables the prediction of its 

20 hypoxia-inducibility. The artificial control of the hypoxic response could reduce tiie level of 
IL6R in relevant tissues and tiius ameUorate the effects of IL6R on disease processes. 
The protein sequence encoded by hypotiietical protein XP_091988 is represented in the public 
databases by the accession XP_091988 and is described herein as SEQ ID No. 13. The 
nucleotide sequence is represented in the public sequence databases by the accession 

25 XM_091988 and is described herein as SEQ ID No. 14. Hypothetical protem XP_091988 
represents a Zinc finger protein likely to be involved in protein-protein interactions via its 
KRAB domain. Thus our observation of EPASl-inducibiUty enables the definition of a role for 
this protein in the regulation of genes necessary for ceU survival under conditions of oxygen 
deprivation. 

30 SEQ ID No. 18 represents a singleton EST that is present in the pubUc sequence databases with 
the accession number R95132. Our observation of its EPASl-inducibility enables the 
definition of its role in cell survival undw conditions of oxygen deprivation. 
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The protein sequence encoded by FRA2, also known as Fos-like antigen 2. is represented in 
the pubUc databases by the accession P15408 and is described herein as SEQ ID No. 19. The 
nucleotide sequence is represented in the public sequence databases by the accession 
BC022791 and is described herein as SEQ ID No. 20. FRA2 is a DNA binding Leucine zipper 
5 and is a member of the FOS superfemily. Members of the FOS superfemily have been 
impUcated as regulators of cell proUferation, differentiation and transformation and arc known 
to dimerize with members of the Jun family of Leucine zippers to form the AP-1 complex. 
ERA2, along with other AP-1 proteins, shows a correlation of elevated levels with clinico- 
pathologic tumour parameters. It is impUcated in the regulation of cell cycle progression 
10 (Bamberger et al 2001, J Cancer Res Clin Oncol vol 127 pp545-50) and has been shown to be 
up-regulated in immortalized cells (Sheerin et al. 2002 Cancer Lett Mar 8; 177(l):83-7). The 
effects of FRA2 on differentiation are well known; e.g. c-Fos and Fra2 co-operate with c-Jun 
to prevent terminal myoblast differentiation and withdrawal from the cell cycle (Dauiy et al 
2001, Oncogene vol 20 pp 7998-8008). FRA2 is also required for cytokine gene induction and 
15 is involved in the regulation of TNF production (Udalova et al. 2001 Biochem Biophys Res 
Commun, vol 289 pp25-33). Levels of FRA2 have been shown to decrease with age 
(Medicherla et al. 2001, Mech Ageing Dev, vol 122 ppl69-86). FRA2 is highly up-regulated 
in epitiielial cells under hypoxic stress. FRA2 also shows marked elevation in repeatedly 
stressed rates. FRA2 up-regulates the stress-related enzymes tyrosine hydroxylase and 
20 dopamine beta-hydroxylase (Nakova et al 2000, J Neuroscience vol 20 pp5647-53). FRA2 has 
also been implicated with a role in tissue remodelling. TIMP is up-regulated in response to cell 
damage. FRA2 plays a role in that up-regulation in injured liver hepatic stellate cells (Bahr et 
al 1999, Hepatology vol 29 pp839-48). SPARC is an extracellular matrix protein, which is 
thought to contribute to the remodelling of the extracellular matrix during neoplastic 
25 progression. Jun-mediated repression of SPARC is enhanced by FRA2 (Vial et al 2000. 
Oncogene vol 19 pp5020-9). Our observation of its EPASl-indudbility enables the definition 
of its role under conditions of oxygen deprivation, in the regulation of genes necessary for cell 
survival. These conditions apply within solid tumours, and in tumour cells cell cycle 
progression is a crucial pathologic feature. Tissue re-modeUing is potentially a damaging 
30 process in diseases such as rheumatoid arthritis. Thus we have shown that interfering with the 
hypoxia-induction of FRA2 could have a beneficial therapeutic effect. 
The full-length protein sequence encoded by DBL also known as Membrane-assodated 
diazepam binding inhibitor, is not represented in the public database. A fragment of the protein 
is represented by accession number XP_038526 and is described here by the SEQ ID No. 21. 
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The full-length nucleotide sequence of DBI is also absent from the public databases but is 
described here by SEQ ID No. 22. DBI binds to an allosteric regulatory site and reduces the 
activity of the GABA(A) receptor complex. (Gray et al 1986, PNAS vol 83 pp7547-51. 
Bormann 1991, Neuropharmacology vol 30 ppl387-9.) GABA(A) activity regulates 
5 steroidogenesis, and thus has a role in reproductive hormone secretion (Papadopoulos et al 
1991, Neuropharmacology vol 30 ppl417-23), and also in the regulation of die behavioural 
responses to stress via corticotrophin releasing factor (CRF) (Guidotti 1991, 
Neuiophannacology vol 30 npp 1425-33). Very recently it has been shown that CRF has utility 
as an anti-inflammatory agent (McLoon et al 2002, Inflainm Res vol 51 ppl6-23). Our 
10 observation of the Hifl a and EPASl regulation of DBI indicates the utility of intervention via 
DBI to influence hypoxia-induced apsects of the stress response and inflammation. DBI binds 
acyl-CoA esters, and affinity increases with acyl chain lengtii (Knusden 1991, 
Neuropharmacology vol 30 ppl405-10). In October 2001 a paper was published linking co- 
incident expression of DBI and the fatty acid binding protein (FAPB). We have shown 
15 previously that expression of FAPB is induced by hypoxia (see UK appUcation 0030076.4). 
The tissue distribution of DBI encompasses regions outside the brain, specifically squamous 
epitiielia in the digestive system (Yanase et al 2001, Arch Histol Cytol vol 64 pp449-60). The 
coordinate hypoxia induction of DBI and FAPB implies a joint role in cell survival under 
conditions of oxygen deprivation. 
20 The protein sequence encoding BACH2 is represented in tiie public databases by the accession 
CAC28130 and is described herein as SEQ ID No. 23. The nucleotide sequence is represented 
in tiie public databases by tiie accession AJ271878 and is described herein as SEQ ID No. 24. 
Bach2 is a transcriptional regulator containing a basic leucine zipper domwn. Its synthesis is 
transcriptionally repressed by constitutive expression of tiie oncogene BCRIASL, a tyrosine 
25 kinase. It has been shown recently that BACH2 functions as a tumour suppressor by repressing 
transcription of genes diat.lead to progression of some leukaemias (Vieira et al 2001, Genes 
Chromosomes Cancer vol 32 pp353-63). The antiproliferative effect of BACH2 could be 
harnessed in the treatinent of malignant disease. Our observation tiiat tiiis gene is induced by 
EPASl makes clear fliat artificial control of tiie hypoxic response could have an 
30 antiproliferative effect in some cell types as a result of increased B ACH2 activity. 

SEQ ID No. 26 represents a singleton EST tiiat is present in tiie public sequence databases witii 
tiie accession number T8601. Our observation of its EPASl-inducibility enables die definition 
of its role in cell survival under conditions of oxygen deprivation. 
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The protein sequence encoding Aldehyde dehydrogenase 12 is represented in the public 
sequence database by the accession NP_072090 and is described herein as SEQ E) No. 27. The 
nucleotide sequence is represented in the pubUc databases by the accession im.022568 and is 
described herein as SEQ ID No. 28. Our observation of its EPASl-inducibility enables the 

5 definition of its role in cell survival under conditions of oxygen deprivation. 

SEQ ID No. 30 represents a singleton EST that is present in the pubUc sequence databases with 
the accession number R06745. Our observation of its Hifla and EPASl-inducibility enables 
the definition of its role in cell survival under conditions of oxygen deprivation. 
SEQ ID No. 32 represents a singleton EST that is present in the public sequence databases with 

10 the accession number N64734. Our observation of its strong EPASl-inducibility enables the 
definition of its role in ceU survival under conditions of oxygen deprivation. 
SEQ ID No. 34 represents a singleton EST that is present in the public sequence databases with 
the accession number T85201. Our observation of its strong EPASl-inducibility enables the 
definition of its role in cell survival under conditions of oxygen deprivation. 

15 The protein sequence encoded by the rat metallothionein la gene is represented in the public 
databases by the accession AAA41590 and is described herein as SEQ ID No. 37. The 
nucleotide sequence is represented in the public sequence databases by the accession J00750 
and is described herein as SEQ ID No. 38. The metallothionein genes are known to play a role 
in scavenging free radicals and therefore protecting against DNA damage (Irato et al 2001, 

20 Immunol Cell Biol vol 79 pp251-4). Our observation of its Hifla -inducibility enables the 
definition of its role in cell survival under conditions of oxygen deprivation. 
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CLAIMS 

1 . A substantially purified polypeptide, which polypeptide: 

i) comprises or consists of the anuno acid sequence as recited in any one of 
SEQ ro Nos: 3, 5, 7, 11, 13. 19, 21, 23 or 27; 
5 ii) comprises or consists of an amino acid sequence encoded by a nucleic acid 

sequence recited in any one of SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, 18, 20, 22, 
24, 26, 28, 30, 32, 34 or 36, or encoded by a gene identified firom an EST 
recited in any one of these SEQ JD Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said 
10 fragment retains a biological activity possessed by the full length polypeptide 

of i) or ii), or has an antigenic determinant in common with the polypeptide of 
i) or ii); or 

iv) is a functional equivalent of a polypeptide of i),ii) or (iii). 

2. A polypeptide according to claim 1, wherein said biological activity is a hypoxia-regulated 
15 activity. 

- 3. A polypeptide according to claim 2, wherein the expression of the polypeptide is hypoxia- 
induced. 

4. A polypeptide which is a functional equivalent according to part iv) of any one of claims 1- 
3, is homologous to the amino acid sequence as recited in any one of SEQ ID Nos: 3, 5, 7, 

20 11, 13, 19, 21, 23 or 27 or is homologous to the amino acid sequence encoded by a nucleic 
acid as recited m any one of SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, 18, 20, 22, 24, 26, 28, 30, 
32, 34 or 36, and has equivalent biological activity to that possessed by the full length 
polypeptide of i) or ii). 

5. A fragment or functional equivalent according to any one of claims 1-4, which has greater 
25 than 50% sequence identity with the amino acid sequence as recited in any one of SEQ ID 

Nos: 3, 5, 7, 11, 13, 19, 21, 23 or 27 or with the amino acid sequence ttiat is encoded by a 
nucleic acid as recited in any one of SEQ ID Nos: 2, 4, 6, 8, 10. 12, 14, 18, 20, 22, 24, 26, 
28, 30, 32, 34 or 36, or with fragments thCTeof, preferably greater flian 60%, 70%, 80%, 
90%, 95%, 98% or 99% sequence identity. 

30 6. A fragment as recited in any one of claims 1-5, having an antigenic determinant in 
common with a polypeptide according to part i) of any one of claims 1-5, which consists of 
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7 or more (for example. 8. 10, 12, 14. 16. 18, 20 or more) amino acid residues from the 
amino acid sequence as it:cited in any one of SEQ ID Nos: 3. 5, 7. 11. 13, 19, 21. 23 or 27 
or the amino acid sequence encoded by a nucleic acid as recited in any one of SEQ ID Nos: 
2. 4. 6. 8. 10, 12, 14. 18. 20. 22. 24, 26, 28. 30, 32, 34 or 36. 
5 7. A purified and isolated nucleic acid molecule that encodes a polypeptide according to any 

one of claims 1-6. 

8. A purified nucleic acid molecule according to claim 7, which consists of the nucleic acid 
sequence as recited in any one of SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 18. 20. 22. 24, 26, 28, 
30, 32, 34 or 36. or is a redundant equivalent or fragment ttiereof. 
10 9. A purified nucleic acid molecule which hydridizes under high stringency conditions with a 
nucleic acid molecule according to claim 7 or claim 8. 

10. A vector comprising a nucleic acid molecule as recited in any one of claims 7-9. 

11. A delivery vehicle comprising a nucleic acid according to any one of claims 7-9. or a 
vector according to claim 10. 

15 12. A host cell transformed with a vector according to claim 11. 

13. An antagonist ligand which binds specifically to. and which inhibits the hypoxia-induced 
activity of, a polypeptide according to any one of claims 1-6. 

14. An agonist Ugand which binds specifically to, and which activates the hypoxia-induced 
activity of, a polypeptide according to any one of claims 1-6 to augment or potentiate a 

20 hypoxia-induced activity. 

15. A ligand according to claim 13 or 14. which is an antibody. 

16. A ligand according to claim 13 or 14, which U a peptide, a peptidomimetic. or a drug 
molecule, such as a small natural or synthetic organic molecule of up to 2000Da, 
preferably SOODa or less. 

25 17. A pharmaceutical composition suitable for modulating hypoxia and/or ischaemia. 
comprising a therapeutically-effective amount of a polypeptide according to any one of 
claims 1-6, a nucleic acid molecule according to any one of claims 7-9. a vector according 
to claim 10 or a ligand according to any one of claims 13-16, in conjunction with a 
phannaceuticaDy-acceptable carrier. 
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18. A vaccine composition comprising a polypeptide according to any one of claims 1-6, a 
nucleic acid molecule according to any one of claims 7-9, a vector according to claim 10 or 
a ligand acccffding to any one of claims 13-16. 

19. A polypeptide according to any one of claims 1-6, a nucleic acid molecule according to 
5 any one of claims 7-9, a vector according to claim 10 or a ligand according to any one of 

claims 13-16, for use in dierapy or diagnosis of disease. 

20. A polypeptide, nucleic acid molecule, vector or ligand as recited in claim 19, wherein said 
disease is a hypoxia-regulated condition. 

21. A polypeptide, nucleic acid molecule, vector or ligand as recited in claim 19, wherein said 
10 hypoxia-regulated condition is tumourigenesis, angiogenesis. apoptosis, inflammation, 

eiythropoiesis, the biological response to hypoxia conditions (including processes such as 
glycolysis, gluconeogenesis. glucose transportation, catecholamine synthesis, iron 
transport or nitric oxide synthesis). 

22. A method of treating a disease in a patient in need of such treatment by administmng to a 
15 patient a therapeutically effective amount of a polypeptide according to any one of claims 

1-6, a nucleic acid molecule according to any one of claims 7-9. a vector according to 
claim 10, a ligand according to any one of claims 13-16 or a pharmaceutical composition 
according to clsum 17. 

23. A method of regulating tumourigenesis, angiogenesis, apoptosis, the biological response to 
20 hypoxia conditions, or a hypoxic-associated pathology in a patient in need of such 

treatment by administering to a patient a therapeutically effective amount of a polypeptide 
according to any one of claims 1-6, a nucleic acid molecule according to any one of claims 
7-9, a vector according to claim 10, a ligand according to any one of claims 13-16 or a 
pharmaceutical composition according to claim 17. 
25 24. A method according to claim 23, wherein, for diseases in which the expression of the 
natural gene or the activity of the polypeptide is lower in a diseased patient when 
compared to the level of expression or activity in a healthy patient, the polypeptide, nucleic 
acid molecule, ligand. compound or composition administered to the patient is an agonist 
25. A method according to claim 23, wherein, for diseases in which the expression of the 
30 natural gene or activity of the polypeptide is higher in a diseased patient when compared to 
the level of expression or activity in a healthy patient, the polypeptide, nucleic acid 
molecule, vector, ligand. compound or composition administered to the patient is an 
antagonist 
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26. A polypeptide according to any one of claims 1-6, a nucleic acid molecule according to 
any one of claims 7-9, a vector according to claim 10, a ligand according to any one of 
claims 13-16 or a pharmaceutical composition according to claim 17, for use in the 
manufacture of a medicament for the treatment of a hypoxia-regulated condition. 

5 27. A metiiod of monitoring the therapeutic treatment of a disease or physiological condition 
in a patient, comprising monitoring over a period of time the level of expression or activity 
of a polypeptide according to any one of claims 1-6, a nucleic acid molecule according to 
any one of claims 7-9, a vector according to claim 10, a Ugand according to any one of 
claims 13-16 or a phannaceutical composition according to claim 17, in tissue from said 

10 patient, wherein altering said level of expression or activity over die period of time towards 
a control level is indicative of regression of said disease. 

28. A metiiod of providing a hypoxia regulating gene, an apoptotic or an angiogenesis 
regulating gene by administering diiecfly to a patient in need of such therapy an 
expressible vector comprising expression control sequences operably linked to one or more 

15 of tiie nucleic acid molecules recited in claims 7-9. 

29. A metiiod of diagnosing a hypoxia-regulated condition in a patient, comprising assessing 
die level of expression of a natural gene encoding a polypeptide according to any one of 
claims 1-6, or assessing flie activity of such a polypeptide, in tissue from said patient and 
comparing said level of expression or activity to a control level, wherein a level tiiat is 

20 different to said control level is indicative of die hypoxia-rclated condition. 

30. A metiiod according to claim 29, tiiat is carried out in vitro. 

31. A method according to claim 29 or claim 30, which comprises tiie steps of: (a) contacting a 
ligand according to any one of claims 13-16 with a biological sample under conditions 
suitable for die formation of a ligand-polypeptide complex; and (b) detecting said complex. 

25 32. A metiiod according to claim 29 or claim 30, comprising die steps of: 

a) contacting a sample of tissue from the patient witii a nucleic acid probe under 
stringent conditions tiiat allow the formation of a hybrid complex between a 
nucleic acid molecule according to any one of claims 7-9 and the probe; 

b) contacting a control sample witii said probe under tiie same conditions used in 
30 step a); and 

c) detecting the presence of hybrid complexes in said samples; 
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wherein detection of levels of the hybrid complex in flie patient sample that differ from 
levels of the hybrid complex in the control sample is indicative of the hypoxia-related 
condition. 

33. A method accordmg to claim 29 or claim 30, comprising the stq)s of: 

5 a) contacting a sample of nucleic acid from tissue of the patient with a nucleic acid 

primer under stringent conditions that allow the formation of a hybrid complex 
between a nucleic acid molecule according to any one of claims 7-9 and the 
primer; 

b) contacdng a control sample with said primer under the same conditions used in 
10 step a); 

c) amplifying the sampled nucleic acid; and 

d) detecting the level of amplified nucleic acid from both patient and control 
samples; 

wherein detection of levels of the amplified nucleic acid in the patient sample that differ 
15 significantiy from levels of the amplified nucleic acid in the control sample is indicative 

of the hypoxia-related condition. 

34. A method according to claim 29 or claim 30, comprising the steps of: 

a) obtaining a tissue sample from a patient being tested for the hypoxia-related 
condition; 

20 b) isolating a nucleic acid molecule accordmg to any one of claims 7-9 from said 

tissue sample; and 

c) diagnosing the patient for disease by detecting the presence of a mutation 
which is associated with the hypoxia-related condition in the nucleic acid 
molecule as an indication of the hypoxia-related condition. 

25 35. The method of claim 34. further comprising amplifying the nucleic acid molecule to form 
an amplified product and detecting the presence or absence of a mutation in the amplified 
product. 

36. A method according to any one of claims 29-35, wherein said disease is cancer, ischaemic 
conditions, reperfiision injury, retinopathy, neonatal stress, preeclapmsia. atherosclerosis, 
30 inflammatory conditions, wound healing, tumourigenesis, angiogenesis, apoptosis, 
inflanmiation or erythropoiesis. 
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37. A method according to claim 36, wherein said hypoxia or ischaemia-related tissue damage 
is due to a disorder of the cerebral, coronary or peripheral circulation. 

38, A method according to any one of claims 29, 30 and 32-36, wherein the tissue is a cancer 
tissue. 

5 39. A method for the identification of a compound that is effective in the treatment and/or 
diagnosis of disease, comprising contacting a polypeptide according to any one of claims 
1-6, or a nucleic acid molecule accordmg to any one of claims 7-9 with one or more 
compounds suspected of possessing binding affinity for said polypeptide or nucleic acid 
molecule, and selecting a compound that binds specifically to said nucleic acid molecule or 

10 polypeptide. 

40. A compound identified by a method according to claim 39. 

41. A compound according to claim 40, which is a natural or modified substrate, an enzyme, a 
receptor, a small organic molecule, such as a small natural or synthetic organic molecule of 
up to 2000Da, preferably 800Da or less, a peptidomimetic, an inorganic molecule, a 

15 peptide, a polypeptide, an antibody, or a structural or functional mimetics of any of these 
compounds. 

42. A kit useful for diagnosing disease comprising a first container containing a nucleic acid 
probe that hybridises under stringent conditions with a nucleic acid molecule according to 
any one of claims 7-9; a second container containing primers useful for ampliiying said 

20 nucleic acid molecule; and instructions for using die probe and primers for facilitating the 
diagnosis of disease. 

43. The kit of claim 42, further comprising a third container holding an agent for digesting 
unhybridised RNA. 

44. An array of at least two nucleic acid molecules, wherein each of said nucleic acid 
25 molecules either corresponds to the sequence of, is complementary to die sequence of, or 

hybridises specifically to a nucleic acid molecule according to any one of claims 7-9. 

45. An airay according to claim 44, which contains nucleic acid molecules that either 
correspond to the sequence of, are complementary to tiie sequence of, or hybridise 
specifically to at le&t 2,^3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16. 17 or 18 of tiie nucleic 

30 acid molecules implicated in a hypoxia-regulated condition as recited in claims 7-9. 

46. An array according to any claim 44 or claim 45, wherein said nucleic acid molecules 
consist of between twelve and two thousand nucleotides. 



PCT/CB02/00817 

WO 02/068466 i-ci/odu* 

88 

47. An anay of antibodies, comprising at least two different antibody species, wherein each 
antibody species is immunospecific with a polypeptide implicated in a hypoxia-regulated 
condition as recited in any one of claims 1-6. 

48. An array of polypeptides, comprising at least two polypeptide species as recited in any one 
5 of claims 1-6, wherein each polypeptide species is implicated in a hypoxia-regulated 

condition, or is a ftmctional equivalent variant or fragment thereof. 

49. A kit comprising an array of nucleic acid molecules according to any one of claims 7-9. 

50. A kit comprising one or more antibodies that bind to a polypeptide as recited in any one of 
claims 1-6; and a reagent useful for the detection of a binding reaction between said 

1 0 antibody and said polypq>tide. 

51. A transgenic or knockout non-human animal that has been transformed to express higher, 
lower or absent levels of a polypeptide according to any one of claims 1-6. 

52. A method for screening for a compound effective to treat disease, by contacting a non- 
human transgenic animal according to claim 51 with a candidate compound and 

15 determining the effect of the compound on the disease of the animal. 
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FIG. 3A 



Experiment #1 

Array Location: l.c.13.1 
Clone: 43550 
Gene: LDH-A 



Array Location: 1.C.13.2 
Clone: 43550 
Gene: LDH-A 
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Experiment #2 



Array Location: 1.C.13.1 
Clone: 43550 
Gene: LDH-A 



Array Location: 1.C.13.2 
Clone: 43550 
Gene: LDH-A 
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FIG. 3B 



Average Relative 
mRNA level 



Oxygen Level: 



O 
CM 

D. 
O) 



o 



CM 



Q. 



2 CO 



o 

CM 



o. 
Ill 























-J- 1, - 



CL 
HI 



FIG. 4B 6 



4 

Average Relative 
mRNA level 3 



Oxygen Level: 



it 



75 



IF 



O 
CM 



O 

o. 



O T- 

CM d 



wo 02/068466 



5/13 



PCT/CB02/00817 



Experiment #1 

Array Location: 2,f,22,1 
Clone: 50117 
Gene: GAPDH 



Array Location: 1 ,f,22,1 
Clone: 50117 
Gene: GAPDH 



FIG. 4A 
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Gene: GAPDH 
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FIG. 5A 



Experiment #1 



Array Location: 2,8,21,12 
Clone: 343320 
Gene: PDGF Beta 
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Array Location: 1 ,g,7,8 
Clone: 67654 
Gene: PDGF Beta 
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Experiment #2 



Array Location: 2e,21,12 
Glone:343320 
Gene: PDGF Beta 



Array Location: 1 ,g,7,8 
Clone: 67654 
Gene: PDGF Beta 
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FIG. 5B 
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FIG. 6A 



Experiment #1 

Array Location: 1 ,a,22,2 
Clone: 768561 
Gene: MCP-1 



Experiment #2 

Array Location: 1,a,22,2 
Clone: 768561 
Gene: l\4CP-1 
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FIG. 7A 



Experiment #1 

Array Location: 2,a,30,5 
Clone: 293336 
Gene: (only ESTs) 



Experiment #2 

Array Location: 2,a,30,5 
Clone: 293336 
Gene: (only ESTs) 
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FIG. ^ 
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FIG. 10 
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SEQUENCE LISTING 

c^aJ^Lgagagcagagagggagcgcgccttcgctcgctcsgccttggcggc^ 

ScTGGGGCGCCCACGAGAGGATCCCTCACCCGGGTCTCTCCTCAGGGATGACATCATC^^^ 

CTCCTTGTCTTCAAGGACCACCTCCTCTCCATGCTGAGCTGCTGCCAAGGGGCCTCCTGCCCATCT 

ACACCTCACGAGGGCACTAGGAGCACGGTTTCCTGGATCCCACCAACATACAAAGCAGCCACTCAC 

TGACCCCCAGGACCAGGATGGCAAAGGATGAAGAGGACCGGAACTGACCAGCCAGCTGTCCCTCTT 

ACCTAAAGACTTAAACCAATGCCCTAGTGAGGGGGCATTGGGCATTAAGCCCTGACCTTTGCTATG 

CTCATACTTTGACTCTATGAGTACTTTCCTATAAGTCTTTGCTTGTGTTCACCTGCTAGCAAACTG 

GAGTGTTTCCCTCCCCAAGGGGGTGTCAGTCTTTGTCGACTGACTCTGTCATCACCCTTATGATC^ 

CCTGAATGGAAGGATCCCTTTGGAAATTCTCAGGAGGGGGACCTGGGCCAAGGGCTTGGCCAGCAT 

CCTGCTGCAACTCCAAGGCCCTGGGTGGGCTTCTGGAATGAGCATGCTACTGAATCACC^ 

CGCCCCACCTCTCTGAAGATCTTCCTATCCTTTTCTGGGGGAATGGGGTCGATGAGAGCAACCTCC 

TAGGGTTGTTGTGAGAATTAAATGAGATAAAAGAGGCCTCAGGCAGGATCTGGCATAGAGGAGGTG 

ATCAGCAAATGTTTGTTCAAAAGGTTTGACy^GGTCAGTCCCTTCCCACCCCTCTT^CTTGTCTTAC 

TTGTCTTATTTATTCTCCAACAGCACTCCAGGCAGCCCTTGTCCACGGGCTCTCCTTGCATCAGCC 

AAGCTTCTTGAAAGGCCTGTCTACACTTGCTGTCTTCCTTCCTCACCTCCAAlTrCCT^ 

CACTGCTTCCTGACTCGCTCTACTCCGTGGAAGCACGCTCACAAAGGCACGTGGGCGTGGCCCGGC 

TGGGTCGGCTGAAGAACTGCGGATGGAAGCTGCGGAAGAGCCCTGATGGGGCCCACCATCCCGGAC 

CCAAGTCTTCTTCCTGGCGGGCCTCTC6TCTCCTTCCTGGTTTGGGCGGAAGCCATCACCTGGATG 

CCTACGTGGGAAGGGACCTCGAATGTGGGACCCCAGCCCCTCTCCAGCTCGAAATCCCTCCACAGC 

CACGGGGACACCCTGCACCTATTCCCACGGGACAGGCTGGACCCAGAGACTCTGGACCCGGGGCCT 

CCCCTTGAGTAGAGACCCGCCCTCTGACTGATGGACGCCGTGACCTGGGGTCAGACCCGTGGGCTG 

GACCCCTGCCCy^CCCCGCJ^GGAACCCTGAGGCCTAGGGGAGCTGTTGAGCCTTCAGTGTCTGCATG 

TGGGAAGTGGGCTCCTTCACCTACCTCACAGGGCTGTTGTGAGGGGCGCTGTGATGCGGTTCCAAA 

GCACAGGGCTTGGCGCACCCCACTGTGCTCTCAATAAATGTGTTTCCTGTCTTAACAAAAAAAAAA 

AAAAAAAAAA 
Seg ID 3 

MELQTLQEALKVEIQVHQKLVAQMKQDPQKADLKKQUiELQAKITALSEKQKRVVEQIJUa^ 

EQPDKFQIQPLPQSENKLQTAQQQPLQQLQQQQQYHHHHAQQSAAASPNLTASQKTVTTASMITTK 

TLPLVUCAATATMPASWGQRPTIAMVTAINSQKAVLSTDVQNTPVNLQTSSKVTGPGAEAVQIV^ 

KNTVTLQVQATPPQPIKVPQFIPPPRLTPRPNFLPQVRPKPVAQNNIPIAPAPPPMLAAPQLIQRP 

VMLTKFTPTTLPTSQNSIHPVRVVNGQTATIAKTFPMAQLTSIVIATPGTRIAGPQTVQLSKPSLE 

KQTVKSHTETDEKQTESRTITPPAAPKPKREENPQKLAFMVSLGLVTHimEEIQSKRQERK^ 

ANPVYSGA\7FEPERKKSAVTYIiNSTMHPGTRKRGRPPKVNAVIiGFGALTPTSPQSSHPDSPENEKT 

ETTFTFPAPVQPVSLPSPTSTDGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPI.KTIPKGMW 

ICPRCQDQMLKKEEAIPWPGTIiAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQLEQKVKQLSNS 

ISKCMEMKlTOIIJ^QKEMHSSLEKVKQLIRLIHGIDLSKP\nDSEATVGAISNGPDCTPPANAATST 

PAPSPSSQSCTANCNQGEETK 
Seg ID 4 

GTGAGAGAGAGAGGAGAGGAGAGGAGGAGGAGGAGGAGGGAGAAGGGAACAACCTACCATCTTAAC 
ACACTAATATCTAAAAAGTGCGAGAGGCCCAGAGCAGCAGCAGAAGCAGCAGCAGCAGCTCCAGCT 
TCTTCCCTCCCTCCCCATGAAGAAGAGTTCCCTCCTCCTCCTCCTCCTGCTTCTCCTGCTCAGAGT 
TCCTGCCTCCAGCTGCCAGGGGGGACAGCCAGCCAGCAGCAGGAGGGGGGCTAGAGAGCTGAAGGA 
GAGCCAGTTTCCCCAAAATTGCTGCAGTGAGAAGAGaAGTTTGTTACTTTAAACAGAGGCT^ 
AACTATAGAATTAGCAGAGAAAGTCGAGAAGGTAGAGGATGGAGTTGCAGACTCTACAGGAGGCTC 
( OTAAAGTGGAAATTCAGGTTCACCAGAAACTGGTTGCTCAAATGAAGOVGGATCCACAGAATGCTG 
ACTTAAAGAAACAGCTTCATGAACrCCAAGCCAAAATCACAGCTTTGAGTGAGAAACAGAAAAGAG 
TAGTTGAACAGCTACGGAAGJ^CCTGATAGTAAAGCAAGAACAACCGGACAAGTTCCAAATACAGC 
CATTGCCACAATCTGAAAACAAACTACAAAC^GCACJ^GCAGCAACCACTACAGCAACT^ 
AGCAGCAGTACCACCa^CCACCACXSCay^GCAGTCAGCTGCAGCCTCTCCCAACCTGACTGCTTCAC 
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AGAACSACTGTAACTACAGCTTCTATGATTACCAC^GACACTACCTCTCGTCTTGAAAGCAGCAA 
CTGCGACCATGCCTGCCTCTGTGGTGGGCCAGAGACCTACCATTGCTATGGTCACCGCCATCAACA 
GTCAGAAGGCTGTGCTCAGCACTGATGTGCAGAACACACCAGTCAACCTCCAGACGTCTAGTAAGG 
TCACTGGGCCTGGGGCAGAGGCTGTCCAAATTGTGGCAAAAAACACAGTCACTCTGCAGGTTCAGG 
5 CAACACCTCCTCAGCCCATCAAAGTACCACAGTTTATCCCCCCTCCTAGACTCACTCCACGTCCAA 
ACTTTCTTCCACAGGTTCGACCCAAGCCTGTGGCCCAGAATAACATTCCTATTGCCCCAGCACCAC 
CTCCGATGCTCGCAGCTCCTCAGCTTATCCAGAGGCCCGTCATGCTGACCAAGTTCACCCCCACAA 
CCCTTCCCACATCCCAGAATTCCATCCACCCCGTCCGTGTCGTCAATGGGCAGACTGCAACCATAG 
CCAAAACGTTCCCCATGGCCCAGCTCACCAGCATTGTGATAGCTACTCCAGGGACCAGACTCGCTG 
10 GACCTCAAACTGTACAGCTTAGCAAGCCAAGTCTTGAAAAACAGACAGTTAAATCTCACACAGAAA 
CAGATGAGAAACAAACAGAGAGCCGCACCATCACCCCACCTGCTGCACCCAAACCAAAACGGGAGG 
AGAACCCTCAGAAACTTGCCTTCATGGTGTCTCTAGGGTTGGTAACACATGACCATCTAGAAGAAA 
TCCAAAGCAAGAGGCAAGAGCGAAAAAGAAGAACAACAGCAAATCCGGTCTACAGTGGAGCAGTCT 
TTGAGCCAGAGCGTAAGAAGAGTGCAGTGACATACCTAAACAGCACAATGCACCCTGGGACCCGGA 
15 AGAGAGGTCGTCCTCCAAAATACAATGCAGTGCTGGGGTTTGGAGCCCTTACCCCAACATCCCCCC 
AATCCAGTCATCCTGACTCCCCTGAAAATGAAAAGACAGAGACCACATTCACTTTCCCTGCACCTG 
TTCAGCCTGTGTCCCTGCCCAGCCCCACCTCCACAGACGGTGATATTCATGAGGATTTTTGCAGCG 
TTTGCAGAAAAAGTGGCCAGTTACTGATGTGCGAO^CATCTTCCCGTGTATATCATTTGGACTGCT 
TAGACCCCCCTCTGAAAACAATTCCCAAGGGCATGTGGATCTGTCCCAGATGTCAGGACCAGATGC 
20 TGAAGAAGGAAGAAGCAATTCCATCGCCTGGAACTTTAGCAATTGTTCATTCCTATATTGCCTA^ 
AAGCAGCy^GAAGAAGAGAAACAGAAGTTACTTAAATGGAGTTCAGATTTAAAACAAGAACGAG 
AACAACTAGAGCAAAAGGTGAAACAGCTCAGCAATTCCATAAGTAAATGCATGGAAATGAAGAACA 
CCATCCTGGCCCGGCAGAAGGAGATGCACAGCTCCCTGGAGAAGGTAAAACAGCTGATTCGCCTCA 
TCCACGGCATCGACCTCTCCAAACCTGTAGACTCTGAGGCCACTGTGGGGGCCATCTCCAATGGCC 
25 CGGACTGCACCCCCCCTGCCAATGCCGCCACCTCCACGCCGGCCCCTTCCCCCTCCTCCCAGAGCT 
GCACAGCGAACTGTAACCAGGGGGAAGAGACTAAATAACAGAGCCCCTCTAGGAGAAGCCACGGGA 
TCCCGGCGGCAAGGAGAACAGAACACTGAAGACTCTAGAAAAGCAAAGCCGGATTTCTGGAAAGTG 
CAGAATTCTTTTGGTTCTTTGGTTCCAGAGAGAGAGAAGATGCTTGTGCCAGGTGGCACCAGAGTT 
TGCCAATTGATCCTTCTTATTCTGTGTGTACATGCAAAGATTGGACCATGTTACATGAAATAGTGC 
30 CAGCTGGAGGTTCTTTCCCAGCACOVTGCCAAGTGAAATAATATATTTACTCTCTCTATTATACAC 
CAGTGTGTGCCl^GCAGCCTCCACAGCCACGATGGGTTTGTTTCTGTTTTCTTGGGTGGGGAGC 
AGGGACGGGCGGAGGGAGGAGAGCAGGTTTCAGATCCTTACTTGCCGAGCCGTTTGTTTAGGTAGA 
GAAGACAAGTCCAAAGAGTGTGTGGGCTOTCCTGTTTCTAAACTTTCGCTACTATAAAACCAAAAA 
AAGGAATTGAGATTTCACCAACCCCAGTGCCCAGAAGAGGGAAGGGGAGTGGCTGGAGGGAGCAGG 
35 GGGTGGGACAGTGTATCAAATAAGCAGTATTTAATCACCTCTGGCGGGGGCCTCGTGCAAGGGGAG 
ACTGACACCAAGAACAGCCAGTAGGTTCTTCTCCCCTGCACTCTGCTCCCTGCGCGGTAACCCC^^ 
CACTCCTGAAGCCTGCCCAGTCTCCTTCCTTCCCTGCTTGGTGAGTCGCGCATCTCCGTGGTTATC 
CCGCTGTCTCCTCTCCAAGAACAAGCAGAGCCCGGGCCACTGGCCCTTGCCCAAGGCAGGGAAGAA 
GGATGTGTGTGTCCAGGAAGGAAAAAAAGGTGGATCAGTGATTTTACTTGAAAACAAGCTCCATCC 
40 CTTTTCTATATTTATAAGAAGAGAAGATCTTGAGTGAAGCAGCACGCGACCCAGGTGTGTGTGAAT 
TGAATGGAGACGTTTCTTTTCTCTTTCTTTAATTriTGTTOT 

TTTTATTTTACTGTTCATTTTACTTTCTTGGTAACAAAAACTAAAATAAGGAATAGAAAAGCTGTO 
TTTCAGGCTGACAGTCO^TTAAGGGTAGCCAAGACCTTGCATGGTAGAGTAGGAATCATAGTC 
AGTGAGGTCCCGTGAGTCTTTGTGAGTCCTTCTGTCATCGTTCGGGCACTGTTTTTTTATGCAAGG 
45 GCAAAAATCTTTGTATCTGGGGAAAAAAAACTTTTTTTTAAATTAAAAAGGAAAATAAAAGATATT 
GAGGTCTTCCTAGTGTTACTTAAATTAAGATCAAGGTAAGAAACATTGTAAAAAAAAATTACAAAA 
GTGCTATTTGTTTCCTAAAAACAGTGATTTCTATTAAAAAGGTGTCAGAACTGG 

Seq ID 5 

50 MPLGHIMRLDLEKIAI^YIVPCLHEVGFCYIiDNFLGEWGDCVIiERVKQLHCTG^ 

GVSKRHIJIGDQITWIGGNEEGCEAISFLLSLIDRLVLYCGSRLGKYYVKERSKAMVACYPGNGTGY 

VIUIVDNPNGDGRCITCIYYUJKNWDAKIJiGGIIJlIFPEGKSFIADV^ 
QPSYATRYAMTVWYFDAEERAEAKKKFRNLTRKTESALTED 

55 Seq ID 6 
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GAGTCTGGCCGCAGTCGCGGCAGTGGTCMCTTCCCATCCCCAAAAGGCGCCCTCCGACTCCTTGCG 

CCGCACTGCTCGCCGGGCCAGTCCGGAAACGGGTCGTGGAGCTCCGCACCACTCCCGCTGGTTCCC 

GAAGGCAGATCCCTTCTCCCGAGAGTTGCGAGAAACriTrCCCTTGTCCCCGACGCTGCAGCGGCTC 

GGGTACCGTGGCAGCCGCAGGTTTCTGAACCCCGGGCCACGCTCCCCGCGCCTCGGCTTCGCGCTC 

GTGTAGATCGTTCCCTCTCTGGTTGCACGCTGGGGATCCCGGACCTCGATTCTGCGGGCGAGATGC 

CCCTGGGACACATCATGAGGCTGGACCTGGAGAAAATTGCCCTGGAGTACATCGTGCCCTGTCTGC 

ACGAGGTGGGCTTCTGCTACCTGGACAACTTCCTGGGCGAGGTGGTGGGCGACTGCGTCCTGGAGC 

GCGTCAAGCAGCTGCACTGCACCGGGGCCCTGCGGGACGGCCAGCTGGCGGGGCCGCGCGCCGGCG 

TCTCCAAGCGACACCTGCGGGGCGACCAGATCACGTGGATCGGGGGCAACGAGGAGGGCTGCGAGG 

CCATCAGCTTCCTCCTGTCCCTCATCGACAGGCTGGTCCTCTACTGCGGGAGCCGGCTGGGCAAAT 

ACTACGTCAAGGAGAGGTCTAAGGCAATGGTGGCTT6CTATCCGGGAAATGGAACAGGTTATGTTC 

GCCACGTGGACAACCCCAACGGTGATGGTCGCTGCATCACCTGCATCTACTATCTGAACAAGAATT 

GGGATGCCAAGCTACATGGTCGGATCCTGCGGATATTTCCAGAGGGGAAATCATTCATAGCAGATG 

TCGAGCCCATTTTTGACAGACTCCTGTTCTTCTGGTCA6ATCGTAGGAACCCACACGAAGTGCAGC 

CCTCTTACGCAACCAGATATGCTATGACTGTCTGGTACTTTGAlXSCTGAAGAAAGGGCAGAAGCCA 

AAAAGAAATTCAGGAATTTAACTAGGAAAACTGAATCT6CCCTCACTGAAGACTGACCGTGCTCTG 

AAATCTGCTGGCCTTGTTOVTTTTAGTAACGGTTCCTGAATTCTCTTAAATTCTTTGAGATCCAAA 

GATGGCCTCTTCAGTGACAAC^TCTCCCTGCTACTTCTTGCATCCTTCACATCCCTGTCTTGTGT 

GTGGTACOTCATGTTTTCTTGCCAAGACTGTGTTGATCTTO^GATACTCTCTTTGCCAGATG^^ 

TATTTGCTAACTCCAGAAATTCCTGCAGACATCCTACTCGGCCAGCGGTTTACCTGATAGATTCGG 

TAATACTATCAAGAGAAGAGCCTAGGAGO^CAGCGAGGGAATGAACCTTACTTGCACTTTATGT^^ 

ACTTCCTGATTTGAAAGGAGGAGGTTTGAAAAGAAAAAAATGGAGGTGGTAGATGCCACAGAGAGG 

CATCACGGAAGCCTTAACAGCAGGAAACAGAGAAATTTGTGTCATCTGAACAATTTCCAGATGTTC 

OTAATCCAGGGCTGTTGGGGTTTCTGGAGAATTATCACAACCTAATGACATTAATACCTCTAGAAA 

GGGCTGCTGTCATAGTGAACAATTTATAAGTGTCCCATGGGGCAGAOICTCCTTTTTTCCCAGTCC 

TGCAACCTGGATTTTCTGCCTCAGCTCCATTTTGCTGAAAATAATGACTTTCTGAATAAAGATGG^ 

AACACAATTTTTTCTCCATTTTCAGTTCTTACCTGGGAACCTAATTCCCCAGAAGCTAAAAAACTA 

GACATTAGTTGTTTTGGTTGCTTTGTTGGAATGGAATTTAAATTTAAATGAAAGGAAAAATATATC 

CCTGGTAGTTTTGTGTTAACCACTGATAACTGTGGAAAGAGCTAGGTCTACTGATATACAATAAAC 

ATGTGTGCATCTTGAACAATTTGAGAGGGGAGGTGGAGTTGGAAATGTGGGTGTTCCTGTTOT 

TTTTTTTTTTTTTTTTTTTTAGTTTTCCOTTTTAAO^GCTCAC^^ 

GTGATGTATTTTAAAAAAGGAAGTGGAAATAAAAAAATCTCAAAGCTATTTGAGTTCTCGTCTGTC 

CCTAGCAGTCTTTCTTCAGCTCACTTGGCTCTCTAGATCCACTGTGGTTGGCAGTATGACCAGAAT 

CATGGAACTTGCTAGAACTGTGGAAGCTTCTACTCCTGCAGTAAGCACAGATCGCACTGCCTCAAT 

AACTTGGTATTGAGCACGTATTTTGCAAAAGCTACTTTTCCTAGTTTTCAGTATTACTCT 

TTAAAAATCCCTTTAATTTCTTGCTTGAAAATCCCATGAACATTAAAGAGCCAGAAATATTTTCCT 

TTGTTATGTACGGATATATATATATATATAGTCTTCCAAGATAGAAGTTTACTTTTTCCTCTTCTG 

QTTTTGGAAAATTTCCAGATAAGACATGTCACO^TTAATTCTCAACGACTOCTCTATrrTG 

CGGTAATAGTTATCACCTTCTAAATTACTATGTAATTTACTCACTTATTATGTTTATTGTCTTGTA 

TCCTTTCTCTGGAGTGTAAGCACAATGAAGACAGGAATTTTGTATATTTTTAACCAATGCAACATA 

CTCTCAGCACCTAAAATAGTGCCGGGAACATAGTAAGGGCTCAGTAAATACTTGTTGAATAAACTC 

AGTCTCCTACATTAGCATTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 

Sag ID 7 

MACGDHFFVCVQRHTGFAFNDLLWLEWTIKTNWLLDKLYITRSSI^ 
STSAWPPCLGSEEPRFPGAAPSGK 

Seq ID 8 

CTCTTGACCAGAAGATTTCTTTACCTACCATTTCATTGGTAGTATGGCCl^ 

TGTGTGTGTACAGCGTCATACAGGCTTTGCCTTTAATGATCTCTTATGGTOAGAAAAC^^ 

AACAAACTGGCTACTGGACAAATTGTATATTACCAGATCATCACTAGCAGCTTTCAGTTGCACTTC 

AGTCCTTTATCAAATTCATAAAGAATTGTTCTGGGAAGTGAG6AGCACCTCTGCCTGGCCACCCTC 

TCTGGGAAGTGAGGAGCCCCGCTTCCCGGGAGCrGCXrCCATCTGGGAAGTGAGGAGCGCTTGCCTG 

GCCGCCCAACTGTCTGGGAAGTGAGGTGTGCCTGTGCCTGGCTGCC06CTTCACTCCATCAAACAC 

GGCTGGGAGTTCGGGAGTCTCCAGCATGGCATTaiTCTTGATCAATGTTACTTATTCOT^ 
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CAAATACGAATTNTCCTTGAAGAGTTGTACAGNTGGGCCGGGCAGGTCAGATCTCTTArrCAAACC 
AACCTAAAAGCTGCGTCTGATTGACTACAGGTGCTCAGCTCGGCCCCAGTCTGGTTTCTGCAGCTT 

CCCG 

Seq ID 10 ^ 

TTAATGTCGAAAATGCAAACTTGGGGAGGGCAGAAAGATCACACACAAGGCTGTCACTTCATACTT 

GCAGGATTGCACAGCAGCCGGGCAGAGGCGCTCCTCACTTCCCAGATGGGGCGGCGGGCANCAGAG 
ACGCACCTCACTTCCTAGACAGTGCGGCANCATGGCACAGGCACACCTCACTTCCCAGACAGTTGG 
GCGGCCAGGCAAGCGCTCCTCACTTCCCA6ATGGGGCGGCTCCCGGGAAGCGGGGCTCCTCACTTC 
CCAGACAGGGTGGCCAGGCAGAGGTGCTCCTCACTTCCCAGAACAATTCTTTATGAATTTGATAAA 
GGACTQAAGTGCAACTGAAAGCTGCTAGTGATGATCTGGTAATATACAATTTGTCCAGTAGCCAGT 
,p^,j«PTTTATTGTGTTTTCTAACCATAAGAGATCATTAAAGGCAAAGCTGTTATGACG 

Seq ID 11 

MLAVGCALLAALLAAPGAAIJ^RRCPAQEVARGVLTSLPGDSVTLTCPGVEPEDNATVHW^^ 

AGSHPSRWAGMGRRLLLRSVQLHDSGNYSCYRAGRPAGTVHLLVDVPPEEPQLSCFRKSPLSNWC 

EWGPRSTPSLTTKAVLLVRKFQNSPAEDFQEPCQYSQESQKFSCQLAVPEGDSSFYIVSMCVASSV 

GSKPSKTQTFQGCGILQPDPPANITVTAVAimPRWLSVTWQDPHSWNSSFYRLRFBLRYRAERSKT 

FTTWMVia)LQHHCVIHDAWSGLRHWQIJlAQEEFGQGEWSEWSPEAMGTPVfrESRSPPAENEVSTP 

MQALTimDDDmLFRDSANATSLPVQDSSSVPLPTFLVAGGSIJ^Gl^LCIAIVLRFKKTWKLRA 

LKEGKTSMHPPYSLGQLVPERPRPTPVLVPLISPPVSPSSLGSDNTSSHNRPDARDPRSPYDISNT 

DYFFPR 
Seq ID 12 

GGCGGTCCCCTGTTCTCCCCGCTCAGGTGCGGCGCTGTGGCAGGAAGCCACCCCCTCGGTCGGCCG 

GTGCGCGGGGCTGTTGCGCCATCCGCTCCGGCTTTCGTAACCGCACCCTGGGACGGCCCAGAGACG 

CTCCAGCGCGAGTTCCTCAAATGTTTTCCTGCGTTGCCAGGACCGTCCGCCGCTCTGAGTCATGTG 

CGAGTGGGAAGTCGCACTGACACTOAGCCGGGCCAGAGGGAGAGGAGCCGAGCGCGGCGCGGGGCC 

GAGGGACTCGCAGTGTGTGTAGAGAGCCGGGCTCCTGCGGATGGGGGCTGCCCCCGGGGCCTGAGC 

CCGCCTGCCCGCCCACCGCCCCGCCCCGCCCCTGCCACCCCTGCCGCCCGGTTCCCATTAGCCTGT 

CCGCCTCTGCGGGACCATGGAGTGGTAGCCGAGGAGGAAGCATGCTGGCCGTCGGCTGCGCGCTGC 

TGGCTGCCCTGCTGGCCGCGCCGGGAGCGGCGCTGGCCCCAAGGCGCTGCCCTGCGCAGGAGGTGG 

CAAGAGGCGTGCTGACCAGTCTGCCAGGAGACAGCGTGACTCTGACCTGCCCGGGGGTAGAGCCGG 

AAGACAATGCCACTGTTCACTGGGTGCTCAGGAAGCCGGCTGCAGGCTCCCACCCCAGCAGATGGG 

CTGGCATGGGAAGGAGGCTGCTGCTGAGGTCGGTGCAGCTCCACGACTCTGGAAACTATTCATGCT 

ACCGGGCCGGCCGCCCAGCTGGGACTGTGCACTTGCTGGTGGATGTTCCCCCCGAGGAGCCCCAGC 

TCTCCTGCTTCCGGAAGAGCCCCCTCAGCAATGTTGTTTGTGAGTGGGGTCCTCGGAGCACCCCAT 

CCCTGACGACAAAGGCTGTGCTCTTGGTGAG6AAGTTTCAGAACAGTCCGGCCGAAGACTTCCAGG 

AGCCGTGCCAGTATTCCCAGGAGTCCCAGAAGTTCTCCTGCCAGTTAGCAGTCCCGGAGGGAGACA 

GCTCTTTCTACATAGTGTCCATGTGCGTCGCCAGTAGTGTCGGGAGCAAGTTCAGCAAAACTCAAA 

CCTTTCAGGGTTGTGGAATCTTGCAGCCTGATCCGCCTGCCAACATCACAGTCACTGCCGTGGCCA 

GAAACCCCCGCTGGCTCAGTGTCACCTGGCAAGACCCCCACTCCTGGAACTCATCTTTCTACAGAC 

TACGGTTTGAGCTCAGATATCGGGCTGAACGGTCAAAGACATTCACAACATGGATGGTCAAGGACC 

TCCAGCATCACTGTGTCATCCACGACGCCTGGAGCGGCCTGAGGCACGTGGTGCAGCTTCGTGCCC 

AGGAGGAGTTCGGGCAAGGCGAGTGGAGCGAGTGGAGCCCGGAGGCCATGGGCACGCCTTGGACAG 

AATCCAGGAGTCCTCCAGCT6AGAACGAGGTGTCCACCCCCATGCAGGCACTTACTACTAATAAAG 

ACGATGATAATATTCTCTTCAGAGATTCTGCAAATGCGACAAGCCTCCCAGTGCAAGATTCTTCTT 

CAGTACCACTGCCCACATTCCTGGTTGCTGGAGGGAGCCTGGCCTTCGGAACGCTCCTCTGCATTG 

CCATTGTTCTGAGGTTCAAGAAGACGTGGAA6CTGCGGGCTCTGAAGGAAGGCAAGACAAGCATGC 

ATCCGCCGTACTCTTTGGGGCAGCTGGTCCCGGAGAGGCCTCGACCCACCCCAGTGCTTGTTCCTC 

TCATCTCCCCACCGGTGTCCCCCAGCAGCCTGGGGTCTGACAATACCTCGAGCCACAACCGACCAG 

ATGCCAGGGACCCACGGAGCCCTTATGACATCAGCAATACAGACTACTTCTTCCCCAGATAGCTGG 

CTGGGTGGCACCAGCAGCCTGGACCCTGTGGATGACAAAACACAAACGGGCTCAGCAAAAGATGCT 

TCTCACTGCCATGCCAGCTTATCTCAGGGGTGTGCGGCCTTTGGCTTCACGGAAGAGCCTTGCGGA 

AGGTTCTACGCCAGGGGAAAATCAGCCTGCTCCAGCTGTTCAGCTGGTTGAGGTTTCAAACCTCCC 
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TTTCCAAATGCCCAGCTTAAAGGGGTTAGAGTGAACTTGGGCCACTGTGAAGAGAACCATATC 
ACTCTTTGGACACTCACACGGACACTCAAAAGCTGGGCAGGTTGGTGGGGGCCTCGGTG 
GCGGCTCGCAGCCCACCCCTCAACACCTCTGCACAAGCTGCACCCTCAGGCAGGTG^ 
CCAGCCAAAGCCTCCTCCAGCCGCCATGCTCCTGGCCCACTGCATCGTTTCATCTTCCAACTCA^ 
5 CTCTTAAAACCCAAGTGCCCTTAGCAAATTCTGTTTTTCTAGGCCTGGGGACGGCTTOT 

CGCCAAGGCCTGGGGGAAGAAGCTCTCTCCTCCCTTTCTTCCCTACAGTTCAAAAACAGCTGAGGG 
TGAGTGGGTGAATAATACAGTATGTCAGGGCCTGGTCGTTTTCAACAGAATTATAAOT^ 
ATTAGCAGTTTTGCCTAAATGTGAATGATGATCCTAGGCATTTGCTGAATACAGAGGCAACTGC^^ 
TGGCTTTGGGTTGCAGGACCTCAGGTGAGAAGCAGAGGAAGGAGAGGAGAGGGGCACAGGGTC 

10 ACCATCCCCTGTAGAGTGGGAGCTGAGTGGGGGATCACAGCCTCTGAAAACCAATGTTC^^ 

TCCACCTCCCACAAAGGAGAGCTAGCAGCAGGGAGGGCTTCTGCCATTTCTGAGATCAAAAC^ 
TTACTGCAGCTTTGTTTGTTGTCAGCTGAACCTGGGTAACTAGGGAAGATAATATTAAGGAAGA 
ATGTGAAAAGAAAAATGAGCCTGGCAAGAATGCGTTTAAACTTGGTTTTTAAAAAACTGC^^ 
TTTTCTCTTGAGAGGGTGGAATATCCAATATTCGCTGTGTCAGCATAGAAGTAACTO^ 

15 TGGGGGAAGCACCATAACTTTGTTTAGCCCAAAACCAAGTCAAGTGAAAAAGGAGGAAGAGAA 
ATATTTTCCTGCCAGGCATGGAGGCCCACGCACTTCGGGAGGTCGAGGCAGGAGGATCACTTC^ 
CCAGAAGTTTGAGATCAGCCTGGGCAATGTGATAAAACCCCATCTCTACAAAAAGCATAAAAATTA 
GCCAAGTGTGGTAGAGTGTGCCTGAAGTCCCAGATACTTGGGGGGCTGAGGTGGGAGGATCTCTTG 
AGCCTGGGAGGTCAAGGCTGCAGTGAGCCGAGATTGCACCACTGCACTCCAGCCTGGGGTGAC^^ 

20 GCAAGTGAGACCCTGTCTC 
Seq ID 13 

MEGLVMFQDLSIDFSQEEWECLDAAQKDLYIUDVMMENYSSLVSIj^^ 
SRDVLGGWCRDVFISTQQasrAMFLLSGYFSDSEFRCKTKDSCLPKEIYEVTSSQW^^ 
25 SSVRDDI^nSCKGQFQHQDINQiaiYLEKAIMTYETTPTFCLQTSLTI^RIHPGE^ 

GSELTQQQETHTGEKLYKCKECGKAFHHFSYLVKHQRIHTGEKPCACKEYGKAFISGS^ 

YTDERPHECQESVKAFRPSAHLIQHVmiHTGDKPYECKECGKSFTSGSTLNQHQQIHTGEK^ 

QCGKSFTVGSTLIRHQQIHTGEKPYDCKECGKSFASGSALIRHQRIHTGEKPYDCKECGKSFTFHS 

alirhqrihtgekpyix:kecgksftfrsglighqaihtgekpyix;kecgksftagstliqhqriht 

30 GEKPYDCKECGKSFASGSALLQHQRIHTGEKPYCCKECGKSFTFRSTRimHQRIHTGEKPYNCKEC 
GKSFASGSALLQHQRIHTGEKPYHCKECGKSFTFRSGLIGHQAXn^TGEKPYIXnCECGKSFTSRSAL 
IQHQRIHTGEKPYHCKECGKSFTVGSTLLQHQQIHTGEKPYDCKECGKAFRLRLRLTQHQQIHTGE 
KPYQCQECGKAFVSVSGLTQHHRIHTGEKPYECPDCGKAFRQRTYmQHRRIHTGEKPYE^^ 
SFTFCSGLIQHQQiraTDEKPYDGKECGKSFTSHSTLIQHQQIHTGEKPYDCKECGKSFTSHSTLIQ 

35 HQQIHTGEKLYDCKECGKSFTSHSTLIQHQPLHTGEKPYHCKECGKSPniRSALIQHRPVHTC 

YSCKECGKSFTSRSTLIEHQRIHTGEKPYHCKECGKSFAFRSAIIQHRRIHTGEKPYDCKECGKAF 
RRRSKLTQHQRIHTGEKPYRCHECGKAFVRFSGLTKHHSIHTGEKPYEOCTCGKSFRQRTHLTLHQ 
RIHTGDRPYECKECGKSFTCGSELIRHQRTHTGEKPYIDCKECGKAFRCPSQLSQHKRIHTGEKTYQ 
CPECGKAFFYASGLSRHQSVHTGEKPYECKTCGKAFKQLTQLTRHQRIHDLT 

40 

Seg ID 14 

ATGGAGGGGTTGGTCATGTTCCAGGATCTGTCTATAGACTTCTCTCAGGAGGAATGGGAGTCCCTO 
GACGCTGCTCAGAAGGACTTATACAGAGATGTAATGATGGAGAACTACAGCAGCCTGGTCTCACTA 
GGTCTCTCTATCCCAAAGCCTGATGTGATTTCCOTACTGGAGCAAGGGAAAGAGCCCT^ 

45 TCAAGG6ACGTGCTGGGAGGATGGTGCCGAGATGTTTTTATTTCTACTCAACAAGG 
TTTTTGCTTTCTGGTTATTTTTCAGACTCGGAGTTTAGATGTAAGACCAAAGAT^ 
AAAGAAATCTATGAAGTAACATCATCTCAGTGGGTGAGAATGGAAAAATGTCT^TAGCC^^ 
TCCAGTGTCAGAGATGACTGGGAATGCAAAGGCCAGTTTCAGCACCAAGATATAAATCAG^ 
TATTTGGAAAAAGCAATAATGACTTATGAAACAACACCAACTTTCTGCC^^ 

50 CTGCATCATCGGATTCATCCTGGAGAGAAACTGTATAAATCCACAGAATGTATGGCTTTTAAGTAT 
GGCTCAGAACTTACACAACAGCAAGAAACCCATACTGGTGAAAAACTCTATAAATC^ 
GGGAAGGCCTTTCATCACTTTTCCTATCTTGTTAAAC^^ 
TGTGCATGTAAAGAATATGGAAAAGCTTTTATTTCTGGCTCACT^TCTTAOT 
TATACAGATGAGAGACCTCATGAGTGTCAGGAATCCGTGAAGGCCTTTAGACCGTCTGCACATCTO 

55 ATTCAACATTGGAGAATTCATACTGGTGACAAACCTTATGAATGTAAAGAAOXSTGGGA^ 
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ACTTCTGGCTCAACACTAAATCAACATCAGCAAATTCACACTGGTGAGAAACCCTATC^^ 
CAATGTGGAAAATCTTTTACTGTTGGCTCAACACTAATTCGACACCAGCAAATT^ 
AAACCGTATGATTGTAAGGAATGTGGAAAATCTTTTGCTTCAGGCTCAGCACTAATTCGACAT^ 
CGAATTCACACTGGTGAGAAACCCTATGACTGTAAGGAATGTGGAAAATCTTTTACTTTTCACT 
5 GCTCTAATTCGACATCAGCGAATTCACACTGGTGAGAAACCCTATGATTGTAAGGAATGTGGGAAA 
TCTTTTACTTTTCGCTCAGGGTTAATTGGACACCAGGCAATTCACACTGGTGAGAAAGCCT^ 
TGTAAAGAATGTGGAAAATCTTTTACTGCTGGCTCAACACTAATTCAACATCAGCG^ 
GGTGAGAAACCCTATGATTGTAAGGAGTGTGGGAAATCTTTTGCTTCGGGCTCAGCACTACTTC^ 
CATCAGCGGATTCACACTGGTGAGAAACCCTATTGTTGTAAGGAATGTGGAAAATCTm 

10 CGCTCAACACGCAATCGACACCAGCGAATCCACACTGGTGAGAAACCCTATAATTGTAAAGAATGT 
GGAAAATCTTTTGCTTCTGGCTCAGCACTACTTCAACATCAGCGAATTCACACTGGTGA 
TATCATTGTAAGGAATGTGGAAAATCTTTTACTTTTCGCTCAGGGCTAATTGGACATC^^ 
CACACTGGTGAGAAACCCTATGATTGTAAAGAATGTGGAAAATCTTTOACTTC^ 
ATTCAACATCAGCGAATOCACACTGGTGAGAAACGCTATCACTC 

15 ACTGTTGGCTCAACACTACTTCAACATCAGCAAATTCACACTGGTGAGAAACCT^^ 
GAATGTGGGAAGGCCTTTAGACTTCGTTTACGACTTACTCAACATCAACAAATC 
AAACCTTATCAATGTCAGGAATGTGGGAAAGCCTTTGTCAGTGTCTCAGGACTCACCCAACAT^ 
AGAATTCACACTGGTGAGAAACCCTATGAATGTCCGGACTGTGGGAAGGCCTTTAGACAGCGTACA 
TACCTTAATCAACATCGGAGAATTCATACTGGTGAGAAACCOTATGAATGTAAAGAATGTGGGAAA 

20 TCTTOTACTTTCTGCTCAGGACTAATTCTVACATCAGCAAAATCACACT^ 
GGTAAGGAATGTGGGAAATCTTTTACTTCTCACTCAACACTAATT^ 
GGTGAGAAACCCTATGATTGTAAGGAATGTGGGAAATCTTTTACTTCTCACTCAACAC^^ 
CATCAGCAAATACACACTGGTGAGAAGCTCTATGATTGTAAGGAATGTGGGAAATCTTTOACTTC 
CATTCAACACTAATTCAACATCAGCCACTTCACACTGGTGAGAAACCTTATCATTGC^ 

25 GGGAAATCTTTTACTCTTCGCTCAGCACTAATTCAACATCGGCCAGTTCACACTGGTGAGAAACG 
TACAGTTGTAAAGAATGTGGGAAATCTTTTACTTCTCGCTCAACACTAATTGAAC^^ 
CACACTGGTGAGAAACCCTATCATTGTAAGGAATGTGGAAAATCTTTTGCTTTTC 
ATCCAACATCGGCGAATTCACACTGGTGAGAAACCCTATGATTGTAAGGAATGTGGCAAGGCGTTT 
AGACGTCGTTCAAAACTTACTCAACATCAACGAATCCATACTGGTGAGAAACCTTATCGATGT^^ 

30 GAATGTGGAAAAGCCTTTGTCCGTTTCTCAGGACTCACCAAACATCACAGTATTCAC^ 

AAACCCTATGAATGTAAGACATGTGGGAAGTCCTTCAGACAGCGTACACACCTTACTCTACATCAG 
AGAATTCATACCGGTGACAGACCTTATGAATGTAAAGAATGTGGGAAATCTTTTACTTGCGGCTC 
GAACTAATTCGACATCAGCGAACTCACACTGGTGAGAAACCTTATGATTGTAAGGAATGTGGGAAG 
GCCTTTAGATGTCCTTCACAACTTAGTCAACATAAACGAATCCATACTGGTGAGAAAACTTATCAA 

35 TGTCCGGAATGTGGGAAAGCCTTTTTCTATGCCTCAGGACTCAGCCGACATCAGAGTGTTC^ 

GGCGAGAAACCCTATGAATGTAAGACATGTGGGAAGGCCTTTAAACAGCTTACACA6CTTACTCGA 

CATCAGAGAATTCATGACCTAACATAA 
Seq ID 18 

40 GATCTTCAGTTCCTATTTTACTATAAGCCTTTGAAATGTTAAGGCTGAATAGACCAAGATATGAA^ 
AGGGACAAGAGGAAGTAAGAACAGTGCAGATACAGTGGGAACACACGGTAACTTTCTGCCTTCTC 
ATTCTACGTGGACGGTGTGCTTTCCGGGCAGCAGGTGOTGTGACTCTAAAAAGATCACAC^^ 
CAGTAACTATGTAGGATTTTTGGCACTGTTAAGAGCGTANATimATAAAGTC^ 
AAATTCCCTCAACCAAATTTTTGATAAGTCTGAAATTAAAAAAAAATAATTTOT 

45 ATTCTTGGGTTCTCCNTTTCAGCCCTGCCTTTTAGGAAAACTTACATCCTTTAAAAAACTT^ 
AAAAAGGNTTCCOTTCACNGCAGGNGGTGGACTTNGGG 

Seq ID 19 

MYQDYPGNFDTSSRGSSGSPAHAESYSSGGGGQQKFRVDMPGSGSAFIPTINA^ 
50 TVITSMSNPYPRSHPYSPLPGIASVPGHMALPRPGVIKTIGTTVGRERimEQLSP^^ 
RNKIAAAKCRimRRELTEKLQAETEELEEEKSGLQKEIAELQKEK^ 

RSPPAPGLQPimSGGGSVGAVVVKQEPLEEDSPSSSSAGLDKAQRSVIKPISIAGGFYGEEPLHTP 
IVVTSTPAVTPGTSITLVFTYPSVLEQESPASPSESCSKAHRRSSSSGDQSSDSLNSPTLL^ 

55 Seq ID 20 
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GGCGCTCGCAGAGCCGGAAAGAAGTGCTGTAAGGGACGCTCGGGGGACGCTGTTCCTGAGGTGTCG 

CCGCCTCCCTGTCCTCGCCCTCCGCGGTGGGGGAGAAACCCAGGAGCC3AAGCCCAGAGCCCGCGGC 

GCGGCCGGCGGACGAACGAGCGCGCAGCAGCCGGTGCGCGGCCGCGGCGAGGGCGGGGGAAGAAAA 

ACACCCTGTTTCCTCTCCGGCCCCCACCGCGGATCATGTACCAGGATTATCCCGGGAACTTTGACA 

CCTCGTCCGGGGGCAGCAGCGGCTCTCCTGCGCACGCCGAGTCCTACTCCAGCGGCGGCGGCGGCC 

AGCAGAAATTCCGGGTAGATATGCCTGGCTCAGGCAGTGCATTCATCCCCACCATCAACGCCATCA 

CGACCAGCCAGGACCTGCAGTGGATGGTGCAGCCCACAGTGATCACCTCCATGTCCAACCCATACC 

CTCGCTCGCACCCCTACAGCCCCCTGCCGGGCCTGGCCTCTGTCCCTGGACACATGGCCCTCCCAA 

GACCTGGCGTGATCAAGACCATTGGCACCACCGTGGGCCGCAGGAGGAGAGATGAGCAGCTGTCTC 

CTGAAGAGGAGGAGAAGCGTCGCATCCGGCGGGAGAGGAACAAGCTGGCTGCAGCCAAGTGCCGGA 

ACCGACGCCGGGAGCTGACAGAGAAGCTGCAGGCGGAGACAGAGGAGCTGGAGGAGGAGAAGTCAG 

GCCTGCAGAAGGAGATTGCTGAGCTGCAGAAG6AGAAGGAGAAGCTGGAGTTCATGTTGGTGGCTC 

ACGGCCCAGTGTGCAAGATTAGCCCCGAGGAGCGCCGATCGCCCX:CAGCCCCTGGGCTGCAGCCCA 

TGCGCAGTGGGGGTGGCTCGGTGGGCGCTGTAGTGGTGAAACAGGAGCCCCTGGAAGAGGACAGCC 

CCTCGTCCTCGTCGGCGGGGCTGGACTAGGCCCAGCGCTCTGTCATCAAGCCO^TCAGCATTGCTG 

GGGGCTTCTACGGTGAGGAGCCCCTGCACACCCCCATCGTGGTGACCTCCACACCTGCTGTCACTC 

CGGGCACCTCGAACCTCGTCTTCACCTATCCTAGCGTCCTGGAGCAGGAGTCACCCGCATCTCCCT 

CCGAATCCTGCTCCAAGGCTCACCGCAGAAGCAGTAGCAGCGGGGACCa^TCATCAGACTCCTTGA 

ACTCCCCCACTCTGCTGGCTCTGTAACCCAGTGCACCTCCCTCCCCAGCTCCGGAGGGGGTCCTCC 

TCGCTCCTCCTTCCCAGGGACCAGCACCTTCAAGCGCTCCAGGGCCGTGAGGGCAAGAGGGGGACC 

TGCCACCAGGGAGCTTCCTGGCTCTGGGGGACCCAGGTGGGACTTAGCAGTGAGTATTGGAAGACT 

TGGGrTGATCTCTTAGAAGCCATGGGACCTCCTCCCTCATTCATCTTGCyAGCAAATCCCATTTCT 

TGAAAAGCCTTGGAGAACTCGGTTTGGTAGACTTGGACATCTCTCTGGCTTCTGAAGAGC.CTGAAG 

CTGGCCTGGACCATTCCTGTCCCrrTGTTACCATACTGTCTCTGGAGTGATGGTGTCCTTCCCTGC 

CCCACCACGCATGCTCAGTGCCTTTTGGTTTCa^CCTTCCCTCGACTTGACCCTTTCCTCC^^ 

GTCAGTTTCACTCCCTCTTGGTTTTTATCy^AATTTGCCATGACATTTCATC 

TTAAAGCTCTTCATTTCTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Seg ID 21 

MMKKSADHiaJLEVIVTNGYDKDGFVQDIQNDIHASSSLNGRSTEEVKPIDENLGQTGKSAVCIHQD 
INDDHVEDVTGIQHLTSDSDSEVYCDSMEQFGQEESLDSFTSNNGPFQYYLGGHSSQPMENSGFRE 
DIQVPPGNGNIGIMQWAVEGKGEVKHGGEDGRmSGAPHREKRGGETDEFSNVRRGRGHEMQHLS 
EGTKGRQVGSGGIXSERWGSDRGSRGSIjNEQIALVLMRLQEDMQNVLQRLQKLETLTALQAKSSTST 
LQTAPQPTSQRPSWWPFEMSPGVLTFAIIWPFIAQWLVYLYYQRRKRKLN 

Sea ID 22 

cgctccatgtatnagtttcatgcaggctcttgggaaagctggtgctgctgctgcctgattcccgcc 

gacagaccttgggaccggggccaacactggcagctggagatggcggacacgagatccgtgcacgag 

actaggtttgaggcggccgtgaaggtgatccagagtttgccgaagaatgattcattccagccaaca 

aatgaaatgatgcttaaattttatagcttctataagcaggcaactgaaggaccctgtaaactttca 

aggcctggattttgggatcctattggaagatataaatgggatgcttggagttcactgggtgatatg 

accaaagaggaagccatgattgcatatgttgaagaaatgaaaaagattattgaaactatgccaatg 

actgagaaagttgaagaattgctgcgtgtcataggtccattttatgaaattgtcgaggacaaaaag 

agtggcaggagttctgatataacctcagtccgactggagaaaatctctaaatgtttagaagatctt 

ggtaatgttctcacttctactccaaacgccaaaaccgttaatggtaaagctgaaagcagtgacagt 

ggagccgagtctgaggaagaagaggcccaagaagaagtgaaaggagcagaacaaagtgataatgat 

aagaaaatgatgaagaagtcagcagaccataagaatttggaagtcattgtcactaatggctatgat 

aaagatggctttgttcaggatatacagaatgacattcatgccagttcttccctgaatggcagaagc 

actgaagaagtaaagcccattgatgaaaacttggggcaaactggaaaatctgctgtttgcattcac 

caagatataaatgatgatcatgttgaagatgttacaggaattcagcatttgacaagcgattcagac 

agtgaagtttactgtgattctatggaacaatttggacaagaagagtctttagacagctttacgtcc 

aacaatggaccatttcagtattacttgggtggtcattccagtcaacccatggaaaattctggattt 

cgtgaagatattcaagtacctcctggaaatggcaacattgggaatatgcaggtggttgcagttgaa 

ggaaaaggtgaagtcaagcatggaggagaagatggcaggaataacagcggagcaccacaccgggag 

aagcgaggcggagaaactgacgaattctctaatgttagaagaggaagaggacataggatgcaacac 
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ttgagcgaaggaaccaagggccggcaggtgggaagtggaggtgatggggagcgctggggctccgac 
agagggtcccgaggcagcctcaatgagcagatcgccctcgtgctgatgagactgcaggaggacatg 
cagaatgtccttcagagactgcagaaactggaaacgctgactgctttgcaggcaaaatcatcaaca 
tcaacattgcagactgctcctcagcccacctcacagagaccatcttggtggcccttcgagatgtct 
5 cctggtgtgctaacgtttgccatcatatggccttttattgcacagtggttggtgtatttatactat 
caaagaaggagaagaaaactgaactgaggaaaatggtgttttcctcaagaagactactggaactgg 
atgacctcagaatgaactggattgtggtgttcacaagaaaatcttagtttgtgatgattacattgc 
tttttgttgtccagtagtttagtttgtgtacatatatacacatatatattttgcactacacaaacg 
ataacattttaaggactaatattgctgatacttgaataatcaatctctactaggttataagtagta 
10 tacacagatttaccctgcccttgaacttgaaggacattaaattattaatgatcatttggtaacatg 
tttacctgattatcttccatagagtaacataagctgcttttcaaaggtaccattgtgataatgaga 
• tcaaatttataagttattatttttaattttctaaattaaataaaagaaagaatgcaaaaaaaaaaa 

aaaaaaaaaa 

15 Seq ID 23 

MSVDEKPDSEtm/YESTVHCTNILLGLN^ 

VGQTKOTLWSLEEEVTT^GFGPLl^FAYTAKLLLSRENIREVIRCA^ 

LmSEDGLFVCRKDAACQRPHEDCENSAGEEEDEEEEim)SETAKl!lACP^^ 

VAEKEEALLPEPDVPTDTKESSEKDALTQYPRYKKYQIaACTKNVYNk 

20 SLKPGLARGQIKSEPPSEENEEESITLCLSGDEPDAKDRAGDVEMDRKQPSPAPTPTAPAGAAC 
RSRSVASPSCLRSLFSITKSVELSGLPSTSQQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPH 
VGQKEVSNFTMGSPLRGPGLEALCKQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPV 
PKGLWGAGQSLPSSQAYSHGGLMMDHLPGRMRPNTSCPVPIKVCPRSPPLETRTRTSSSCSSYSY 
AEDGSGGSPCSLPLCEFSSSPCSQGARFLATEHQEPGIiMGDCay[YNQVRPQIKCEQSYGTO^ 

25 SFSEADSESCF^QDRGQEVKLPFPVDQITDLPRiroFQMMIKMHKLTSEQLEFIHD\^^ 
QRCRKRKLIX:iQNLECEIRKLVCEKEKLLSERNQIJCACMGELLDNFSCLSQEVC^^ 
HRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENVPCCLEPGAAPPGPPWAPSNTS 
GRRLEGTDPGTFSERGPPLEPRSQT\mroFCQEMTDKCTTDEQPRKDYT 

30 Seq ID 24 

GCGGCCGCCGCCCTCGCCACCCCGCCTGCCCACTCCCGCCGCCGCCCCGCTCTCGCTOTCCCCCCG 
GCCTCCCCTCGCCCCTTCCCCTCCCCCTTCCCGGCGCACTCGGGGGGCTGGGAACGAGCTGCCATG 
TGATGCGCGTCCCCTCCGCGAGCTTTCGGTGACCCACGAACTGCCCACCTCGCCGGCTGCCGGGAG 
GGGGCTGCGAGCCGGGAAGACGCGGGGAAGAGGAGGCGGAAAAGGACGCAAAGTTCTCCGGCGAGC 

35 GCATTCATTCACATAGCTCCCAGTTTTAACATraCGCCACCTACTGAAGACATCATTO^ 
GCTGATGAGCCCTTGAAGCACAGCAGATAAGAGTGTTGCTGTTGATCATCTTTGCCTC^ 
AATGATAAAGCCAGAAGAAAGCATGCTTTCTGATCATTTGCAGCTGTCTGCTTCAGAAAGTG^ 
CTCCAGGAATGAGGAGAATCTTCAAGAACTTCCTCGCACTGTGACATGTCTGATCCCTTC^ 
TCCCTGCAGCATGAACAAGGTGGACACTCACTGACCTGTCACAAGGTTGCCCC^^ 

40 GTCCATGTCTGAATCGATTGCCAGAGCCTTCTCATCTCTCCCTTCGCCCAGTTCCCTGCATC 
GACTCGAAGGCAGCACAGGACCTGGAAAAATTACy^TGGTGTGAACGGCATG^ 
CCTGACTCCCCCATGTATGTGTATGAGTCCACAGTCCACTGCACCAACATCCTCCTGGGCCT^ 
GACCAGCGGAAAAAGGATATTCTCTGTGACGTGACTTTGATCGTGGAGAGGAAGGAGTTCCGGGCC 
CACCGGGCTGTGCTGGCCGCATGCAGTGAATATTTTTGGCAGGCGCTGGTTGGACAGACAAAAAAT 

45 GATTTGGTGGTCAGCTTGCCTGAGGAGGTCACAGCCAGGGGCTTTGGGCCGCTGT^^ 

TACACTGCCAAGCTGTTACTCAGCAGAGAAAACATCCGCGAGGTCATCCGCTGTGCTGAGTTCCTG 
CGCATGCACAACCTGGAGGACTCCTGCTTCAGCTTCCTGCAGACCGAGCTCCTGAA^^^ 
GGCCTGTTTGTGTGCCGGAAGGATGCTGOSTGCCAGCGCCCACACGAGGACTGCGAGAACTC 
GGAGAGGAGGAGGATGAAGAGGAGGAGACGATGGATTCAGAGACGGCCAAGATGGCTTGCTC^ 

50 GACCAGATGCTTCCAGAGCCCTVTCAGCTTTGAGGCCGCCGCCATCCCCGTAGCAGAGAAGGAAGAA 
GCCCTGCTGCCCGAGCCTGACGTGCCCACAGACACCAAGGAGAGCTCAGAAAAGGACGCGTTAACG 
CAGTACCCCAGATACTAGAAATACCAGCTTGCATGTACCAAGAATGTCTATAATC 
AGTACCTGT^GGTTTTGCAAGCACATTCaSGGAAGATAACTCTAGO^ 

GCCAGGGGGCAGATTAAAAGTGAGCCGCCCAGTGAAGAGAATGAGGAAGAGAGCATCACGCTC 
55 CTGTCTGGAGATGAGCCTGACGCCAAGGACAGAGCGGGGGATGTCGAGATGGACCGGAAACAGCCC 
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AGCCCTGCCCCTACCCCCACGGCCCCAGCTGGGGCCGCCTGCCTGGAGAGATCCAGGAGCGTGGCC 

TCGCCCTCCTGCTTAAGGTCTCTGTTCAGCATAACGAAAAGTGTGGAGCTGTCTGGCCTGCCCAGT 

ACATCTCAGCAGCACTTTGCCAGGAGTCCAGCCTGCCCTTTTGACAAGGGGATCACTCAGGGTGAC 

CTTAAAACTGACTACACCCCTTTCACAGGGAATTATGGACAGCCCCACGTGGGCCAGAAGGAGGTG 

TCCAACTTCACCATGGGGTCGCCCCTCAGGGGGCCTGGGTTGGAGGCTCTCTGTAAACAGGAGGGA 

GAGCTX3GACCGGAGGAGCGTGATCTTCTCCTCCAGCGCTTGTGACCAAGTGAGCACCTCGGTGCAT 

TCTTATTCTGGGGTGAGCAGTTTGGACAAAGACCTCTCTGAGCCGGTGCCAAAGGGTCTGTGGGTG 

GGAGCCGGCCAGTCCCTCCCCAGCTCGCAGGCCTACTCCCACGGTGGGCTGATGGCCGACCACTTG 

CCAGGAAGGATGCGGCCCAACACCAGCTGCCCGGTGCCAATCAAAGTCTGCCCTCGCTCACCCCCC 

TTGGAGACCAGGACCAGGACTTCCAGCTCCTGCTCTTCCTATTCCTACGCGGAGGACGGGAGCGGG 

GGCTCACCCTGCAGCCTCCCTCTCTGTGAGTTCTCCTCCTCGCCCTGTTCCCAGGGAGCCAGATTC 

CTTGCCACAGAACATCAGGAACCAGGCCTGATGGGAGATGGAATGTACAACCAAGTGCGGCCCCAA 

ATTAAATGTGAGCAGTCTTATGGAACCAACTCCAGTGACGAATCCGGATCGTTCTCGGAAGCAGAC 

AGTGAGTCGTGTCCTGTGCAGGACAGGGGCCAGGAGGTAAAACTTCCTTTTCCTGTAGATCAAATC 

ACAGATCTTCCAAGGAACGATTTCCAGATGATGATTAAAATGCACAAGCTAACCTCAGAACAGTTA 

GAGTTTATTCATGATGTCCGACGGCGCAGCAAGAACCGCATCGCGGCCCAGCGCTGCCGCAAAAGG 

AAACTGGACTGTATTCAGAATTTAGAATGTGAAATCCGCAAATTGGTGTGTGAGAAAGAGAAACTG 

TTGTCAGAGAGGAATCAACTGAAAGCATGCATGGGGGAACTGTTGGACAACTTCTCCTGCCTTTCC 

CAGGAAGTTTGCCGAGACATCCAGAGCCCCGAGCAGATCCAGGCCCTGCATCGGTATTGCCCTGTC 

CTCAGACCCATGGACTTGCCCACGGCCTCCAGTATTAACCCTGCGCCCTTGGGTGCTGAGCAGAAC 

ATTGCGGCCTCCCAATGCGCAGTGGGGGAAAACGTGCCCTGCTGCTTGGAGCCAGGCGCGGCTCCC 

CCCGGACCCCCCTGGGCACCCAGCAACACCTCCGAGAATTGTACCTCTGGGAGGAGACTAGAAGGC 

ACTGACCCGGGAACCTTCTCAGAGAGAGGACCTCCTCTTGAACCCAGGAGCCAAACAGTGACCGTG 

GACTTCTGCCAGGAAATGACTGATAAGTGTACAACTGACGAACAGCCCAGGAAAGATTATACCTAG 

TGACTCGGCTCTGCCTCCCAGTCCGCACACCTCTCCCATCCAGGCGTTCTTCAGTCAGCCTGTGGC 

ACTGTTCATCTGCTGTCCCGAAGAAACCGAGAAO^CATTTGGTGCACACTACAGCGGTCTTAGCAG 

CAATACTGTTCCGAAGTATCCTCTCCTCTTCTCGAGCAGGAGTGATAGTTACCTTCACAATGGTGC 

TACCCCTTGCCCAGGCAAGGAAAGACAGCy^GTGATGACACTGTCTGTCTGTGGCTCAATTTCAGTC 

TTCACAGGGATAGACTACAACACCTCTAGGCCCCAACCACGGATTTTTTTTCTCAGTGGCCCATGT 

CACAAACCCTATCTCAGGAATTTCTTCTGAATGTTCAATTTTTTTCATTGAAGACAGCTTCTATAC 

ACATCAAAGTTTTATAGCTAGACTGTACATATTATATATAATATATATATAAAAAATATATATATA 

TATATATATCCM-ATGCAAAAGTCCTGCATGCCTCAACTTTCTCATCCTAAAACrGGAAACTTATT 

TCTCATTTAGAAACAGGTTCCAACATTCCTCTTCTTTTGTCTCTGATGCTAGAACT^^ 

CTGTTAACGTGGTCATTTTTCTTGCTTCACyVGTTCAATTTTCAATTCGTACTTATOT 

ATTCAGTGTOKSGAAGCTTTTTCCCAAGGlTrTATTTCAGATTTCTTTTTCGTT^ 

GCACCTCCAAGTGGTGTCATTTGAGCATTGTAGGTTTGTTTTTTGTTTGTO 

QTTTTTGTTTTTGTTTTTGTTTTCCTTGCy^GATACTGTACAGTAATGGTCa^ 

CTGAGTTTTGG6TCAAACCTATTTTCTTAAATGAAGTTGTAACTTCGGTATAACTCAAGTATACTC 

TATATTCTTTGCTTTTAGTTAAAAAAGTAAAAGATTTTAGCTAATTAAAAAGCACTCAGGTGATAA 

TTATGTAGGAAAAACAATCTTGCCAAATAATGAATTCATCCTAGGATGTGTAGACAATAATCTGCT 

TGAATATTTTTATATTTa^CCTCCTCCCCACCTTTCCCTAAGOiAAGTTTAAACGCAGATAGAGAG 

TTCAGAGTTGATGCTGGATGTTCAGATTCCTAAGTGGGGAGAGAGTTTGGACa^TCTCACTCAAAAG 

TACATCAGAAAAACAGGAATCCGTGATTTTATACCAGAACTCAGCAGGCATTGGCTCCTAGAAATC 

AAGTTA6AAAGTTTTCACCCAGGGAGTAAGTCCCATTCATTTCAACACGTCCTGAGGCCTCGGCTT 

GCTCTTGGAAGTGGTGTGCAGTAGGACCTGCTCCCCTGAAGGACGGGGCa^CCAGCa^CTGGCTT 

TCCTGCCCAGGCTTGGCCTCCCAGGACATCTGGCCTGAGGGGATTTGAATCACAGCCCC6AAGGTC 

CTGCCTTCACCCCATTGGGAGAGAGCAGGGCATCCTGGCATCTGCGATCCATCCCTGACACAGGCT 

GACACATTCTTTCTCCTTTCCTTCTCCAAAGGCTTGGAGTTTTCTTCTGAGGT^ 

TCTTGTCTGAAGGCAGACTTCATTCTGAGGCTTTGGAa^GCTATTCACCGGGAACCCTCCCT^ 

CCXTTCCCGAATO^CACACy^TACCCTACCCTCACCTGATGATAATTTTCTCTTCTTGC^ 

TGGTTGGCTTCCyVACCCAGAGAGAGCAGCTTCtXTTGGCTCTGGGGGCCGTGTTGGCrc 

GTTTACAGGAAGGTGTGCCCCAGAGGAGGAGGAATCAGCTCCCTax:TCCAGTGGCCrTGGGTCCG 

GGTCTCACOXSAGCAGCCCGAGGGCCACTCCAGCCCGGCTGGGGAAGAGAGTCCTGAACGGTTTGAT 

GTGGGGATXMGGTGGTGGGCAGTGGGGAATAGATGGTTGACTTTGTTTCTTTATTTGTC^ 

TTGGACAATATTAAAGCTGCATGTAAAAGGGGAAATTAGTATATGATGTAGGCTAAAAGTGAAATC 
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ATAGTAACATATGTTTTAGTATTATTAACTTTTTTCTGTACAAATATTAGCACTAA^ 
ATGTATGAATGCCAGAAATTTGTCAGTTCATGCAGTAGGATAAAAAAAAAAAAAAAAAAAAAAGGG 

CTTTTCTTOTTAAACAGTTCCACrrTTAAAACCTGCCTCTGGGTTTTTGTTTO^ 
GTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGAAACAGATCTTGATAAAGC 
5 TCTGTGTTGGAGCTGCTGGTTTTTGTTATGGTTGTTGGAATTTCTTCGCCTACTAGGAC^^^ 
TGCTTCACCATGAGGTTTGCCTTTGTGGAAAACTCGTGGGTGACAGTGAGAATATAAACTC^ 
GAATCACGTGATACTTCGGCAGGCGTGTGTTACAGTGGAGTCAGCTGACAGTATTTTGCTT^ 
CTCTATTGTTGCCTTTCCAAGTGACCTCTCCTCTTCTTTTAAAAAAAGAACACTTTCTGCTCAT^^ 
CATAACCAGGTCCAACCCAGCTTCTTGGCATGAGGTTTACCCTGGTAACAACTCATGTGCA^^ 

10 TAGTCTTGACCACATTCCATCCATTTCCTCAGGTTTCTGTGGTTCAGTAGCCCA^^^ 

AGCCATTTCTAGCAGGGGCGGGGCCTCTTTATTTCTCTCCACCCTAACTCAGACCTCACCOT^ 
CCACCCACCCCTGCCTTGCTTTTCTTCCTCTTCCCCCAACCTAACTTCTGCCATGG^ 
AAAACACTGCTCTAAAAACCATCTTCCAATTTCATAGAGATTTCTCACAAGTTAT^ 
TCCACCATGAACAGTGACTAGCTTCGTGCAGTTGTTCATCTGATGTGTGTGTGTCI^ 

15 AGAACTATGTGCTTGTCAAAATTATTTCTGGGTTGATTCAAAGGGAGGACTTGCTGGGGACCAGAA 

• tccaaacggcctcaagtggaaoyitraaaacctagcctgtctcttot 
ccccacgccttttaggaaaaagaaaagtgagtgaacagcaaggaagagtgtttgcacagta^ 
acatttggttgttcttaaggctcttttcttacaaaaataagagaccctcc^ 
ggaggatgcctgcttgggtctccaaatggctggggtaggaatggttgttggggcagagcc^^ 

20 ggtgagtgaccctgagactaatgaacatcccacctaaatccagtcctccccttggatctgcctttg 
tcctgcttgtgtatccaggcaacctctttvrcaagttcgtcaggctttggacag 
tgtatgtgtttgtttctctgcgttacctgggggtgccttgattaaaatcgaactto 
gattctggaacaaaacagttagaaaaactrtaaactttaaaaaaaaaaa^ 
ggccatcctgctatttatcttctgagttcccagcaatgactcaggcatcagagatgatgctgc^^^ 

25 ggaaaacctgactctgtgtgtctgcaactgaatgttgtgcgagtaatttattaactgtctttc 
aggtttgctgcttttaagatgcactataattcgggatgtaatccttacattgcttto 
ggaacaaaagtctagtgattagtatgccaactgccactactccttcaaaaggagccag^^ 
acaagactcatgagaggactggctaaagtgaagtgtgcacagtgtgaagtttaatgctgttgtca^ 
gaggcctaaaccca^cattttctcttttaatattttatgattgccatcaaagaagaagaa^ 

30 gaacagacaaaggtttgaaaatgataagcctcttaagacacraaaaactcctc^ 

cttgacatcctgtggagtagcataatcctctcaaaatgaggaagagctgcctgcaaagctto 
agtccctatttggctacctacttctctacattatgccccatttaaactaggagctgtcttaga^ 
gacttcaaactgcttcactattgcttacagtttaggaggagtctcagatccagaaggagcaagaat 
caagtttggtcctcaaatgactgtaaatagactaaagaacaaggtgttttttgtttt^ 

35 tttctaagaataaagctgttcgttgtatcatgagttagtgtttcttccccaaactc 
tggaagtgcaatttctggtgagtcagtccacaatacaatgccctgtc 
ggaaatctgtgtgcacgaggcattgtgtgttgaaagtgtatgtttatagtactgcc^^ 
catgaccccagcgtccaaaaccgatgctgtagaacagaacatatctgtcacaaataggtc^ 
aatagcatttgtacatagaaaagtctcy^ttgtggcagal^gct^taaattattca^^ 

40 caaaaacattacttgcaaagaaaagtttatagtattttcctacactccaccctgg^ 
tctatcaaatgaatatcagtgcattttaaatgtaatatgaaaacgatgctgcca 
tacccacttggttgcagaggccaactttcatagctttgatttaatgttgtc^ 
tttgctgtcaagcaatggataaacagctctgactttcattctcyvttccagtra^ 
aaaacactggcccttcttagaagcagaagtgtgcaccaagaccattcatttcaggtag 

45 tcagtgccaagtgctcccatgggaataatcagacgcatatgttgcgaaagagtgaagggacttgga 
caaagaggggttttcctacagatggatgctcagtcttctaccaaaacatgtttggaggc^ 
tgacctccccttaagtcctaacaatgtattttgtgtgtgcaaatcctgggatgcccgt^^ 
ctgacataaagacatggcacctctagtgagtgatcaggaagattccatatgc^ 
ggtgcttgttagacacagtgagccattcaaggcaagcaccacctttgctag 

50 ctgtgacaatttgacaatttgttccagaaccagtctgatgcaa 

caaactccagaggccatattcaaaacagggtcttctcagtgtatgcaaggggctgc^ 
ctcttcctccccaggttgaacaatacggacagttttcacacatatctacctgtataacc^^ 
cctctcataactggtcaacgactgtaacaggttacatcaggtgtttttctac^^ 
gattctatgcgattaatgtaatttaattcaatgcatcattttattgtactagto^ 

55 cttatttttttctaagtgattgtggtttttctcgtggtttttattgtaa;^^ 
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ATGCTTATTCTCTGTAACTAAGAATTTTACCTTTTGGGGGAAAAAAGCATTGCTAT^ 

AATTGGAACTTCATTTACTCATTGTAAATACACTAITGTGCAAAAAAAGTTTTC^ 

TGCTAGTGTTAACTGAATTTTGTCTAGACACCATTTCTGTTGATGAAATAAAGACATATC^ 

CATTGTAAACTG 

5 

Seg ID 26 

TTCTAAAGCTACTGGAGCACGGGACACGTGTCTGTACGCTCCTCTCACACCCCTTACTCGGCAGCT 
GCCCACTCCCGCCGTTGCCGTTGCCCTGAGTCTTCCCCACTGTTCTGCTGTGCCAGTGTGAGCATC 
TCCCCGGCACTCAAGTCCATGTTCTCTCCCTCCCGCTCTTCCTCCTGTCTCTTCCATGCTC 
10 TGCACGGGGACATCATGTGGGGCTTGATGCCGTACACAGTCCCTGGACCATGGGCCCCCTAA^ 

TTTCTTCCAGGAAGTGTTCACACTAGGACAGGGAGGTGGTTGGAGGTCACCAACTTCACCTTTTCC 
CTAGGAATGTGGGGGTTTTGTTTTTTTCTTCCCCCCATTCATTTGG 

Seq ID 27 

15 l^GTNALLffliENFIIXSKFLPCSSyiDSYDPSTGEVyCRVPNSGKDEI^ 
QERSRVLNQVADLLEQSLEEFAQAESKDQGKTLALARTMDIPRSVQNF^ 
HLGOyEHYTVRAPVGVAGLISPWNLPLYLLTWKIAPAMAAGls^ 

VPPGVVNIVFGTGPRVGEALVSHPEVPLISFTGSQPTAERITQLSAPHCKKLSLELGGKOT 
DANLDECIPATVRSSFANQGEICLCTSRIFVQKSIYSEFLKRFVEATRKWKVGIPSDPLVSIGALI 
20 SKAHLEKVRSYVKRAIiAEGAQIWCGEGVDKLSLPARNQAGYFMLPWITDIK^^ 
TCVVPFDSEEEVIERANNVKYGIJ^TWSSNVGRVro 
SGIGREGAKDSYDFFTEIKTITVKH 

Seq ID 28 

25 AATTCTAGGGACAACCTGAGTGCT(^GTCGTAAAGAGGAAAGGCAGAATTTTTCCTTGCTATG 
GGAACAAACGCACTTTTGATGCTGGAAAACTTCATAGATGGAAAATTTTTACCTTGTA^ 
ATAGATTCTTACGACCCATCAACAGGGGAAGTGTATTGCAGAGTGCCAAATAGTGGAAAAGAC^^ 
ATCGAAGCCGCGGTCAAGGCCGCCAGAGAAGCCTTTCCCAGCTGGTCATCCCGCAGCCCCCAGGAG 
CGCTCACGGGTCCTGAACCAGGTGGCGGATTTGCTGGAGCAGTCCCTGGAGGAGTTTGCCCAGGCC 

30 GAGTCTAAAGACCAAGGGAAAACCTTAGCACTGGCAAGAACCATGGACATTCCCCGGTCTGTGCAG 
AACTTCAGGTTCTTCGCTTCCTCCAGCCTGCACCACACGTCAGAGTGCACGCAGATGG 
GGCTGCATGCACTACACGGTGCGGGCCCCGGTGGGAGTCGCTGGTCTGATCAGCCCCTGGAATTO 
CCACTCTACTTGCTGACCTGGAAGATAGCTCCAGCGATGGCTGCAGGGAACACTGTGATAGCC^ 
CCCAGTGAGCTGACTTCAGTGACTGCGTGGATGTTGTGCAAACTCCTGGATAAAGCAGGTC^ 

35 CCAGGTGTGGTCAATATTGTGTTTGGAACCGGGCCCAGGGTGGGTGAGGCCCTGGTC^ 

GAGGTGCCCCTGATCTCCTTGACCGGGAGCCAGCCCACCGCTGAGCGGATCy^CCCAGCTGAG 
CCCCACTCCAAAAAGCTCTCCCTGGAGCTGGGGGGCAAGAATCCTGCCATCATCTTTGAG^ 
AACCTGGATGAGTGCATTCCGGCAACCGTCAGGTCCAGCTTTGCCAACGAG 
TGTACCAGCAGGATCTTTGTCCAGAAGAGCATCTATAGTGAAOTTTTAAAGAGATOT 

40 ACCAGAAAGTGGAAAGTCGGCATTCCCTCTGATCCACTGGTGAGCy^TAGGT^ 

GCACATTTGGAGAAAGTCAGAAGTTACGTCAAGAGAGCTCTTGCTGAAGGTGCCCAAA^^ 
GGTGAGGGAGTGGATAAGTTGAGCCTCCCTGCCAGGAACCAGGCAGGCTACTTTATCCTTCC 
GTGATAACy^GACATTAAGGATGAATCCTGCTGCATGACGGAAGAGATATTTGGTCCAGTC^ 
GTCGTCCCCTTTGATAGTGAAGAGGAGGTGATTGAAAGAGCCAACAACGTTAAGTATGGGCTGGCG 

45 GCTACCGTGTGGTCCAGCAATGTGGGGCGCGTCCACCGGGTGGCTAAGAAGCTGCAGTCTGGC^ 
GTCTGGACCAACTGCTGGCTCATCAGGGAGCTGAACCTTCCTTTCGGGGGGATGAAGAGTTC 
ATAGGTAGAGAGGGAGCCAAGGACTCTTACGACTTCTTCACTGAGATCAAAACCATCACCGTTAAA 
CACTGATCTTTGCTAATGGTGGAGCCACTATGGCCAATGCCTGGCTGCAGGCATC^^ 
GTGGTAGATGAAAATCATGGCATGAATTCCAGCTATGCCTTCACCTGGCAGAAGGTTATC^ 

50 TTATCCTCAGTTCTTAGTAACTTTACCCACTAGTGAAGAGATACTGTCTATTTTCAATC 
GGAAAAAAAGACTTATAAGTAGGAAGACAGAACAATGATGCCAGTTGTCAGGCTCCTC 
TGTTTTCATAGTGTTTCTTTCATCATCTTCATTGAACTCTT^ 
TTCATTTGGTAAATTTTAAAAAATATGCT^TCAGGC^ 

TTGGGAGGCCAAGGTGGGTGGATCACTTGAGTTCAGGAGTTCGAGATCAGCCTAG^ 
55 AAATCCTGTCTTTACCAAAAGTTTAAAAATTAGCTTGGTGTGGTGCCCT^ 
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TACTTGGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAGGCGGTTGAGGCTGCAGTGAGCCATGAT 
CATTCCACTGCATTTCAGCCTGGGGGATACAGTGAGACCTTGTCTTTAAAAAAAAAATC 
ACCAAAAATAAATAGCCATAATTTGCCATCTTTGGAGAGATTATGGAGATAATCTCCOT^ 
AGCCACTAGCATGGCTAATTGATCATTGATTGGACCTCATTACATAAGGGGGCTGTGGACGTC^ 
5 GGAATTTGTTAGGGGAAGGCTATGTGGTAAGCTCTACTAAATGTATCTGTACAAGGATGAAGAA^ 
ACAAAATAACTACTACTTTTGGAGGAAACTTGGAACAAGAAGAGTACATAGTTTGATCCATTCCTA 
CGGTCGAGTCTAATCAATTCCATTTCTTAGTGTATGCATCTAATGAGGCTTACAACTTGCTAACCC 
CAAAAGAAGCAATAAATGTTAAGCAATTTTAATAAAAAAAAAA 

10 Seq ID 30 

TCCAGGGTATACGTGCAGGATGTGCAGGTTTGTTACACAGGTAAACATGTGCTATGGTGGTTTGCT 
GTACCACACCCTGGTAATCTTTATTTTTTGTAGAGATGAGGTCTCACTATGTTGCACAGAGGGATC 
TCGAACTCCTGGGCTCAAGGGCTCACAGCCTCCCAAAGCACTGGGATTACCCCTGTGAGCA^ 
GCCTGGCCTTATTTTTGTACAATGTTGCTAGTAATCCCTACCTCAGCAGGCTGTGGTAAGGOT 
15 AAGGTACATTGTACATAAAGTACATTTTCTAGTACGTTTAATAAATGTTCAT^ 
AAATTTCTCAGAGGAAOTGCGAAGGAGTGGGACACT 

Seg ID 32 

GAGACTGCATAGGGCTCGGCGTGGTATOGATGCAAGGATAGGTATATATATCAATGGAAGA^ 
20 GAGAGTTCACT^TAAACTCTTACAATTATAGCCAATGATTTTCAACATAGGTGTCCAAACAATTC 
AATGGAAGAAAGGAGAGTCTCTTTTTTTCACATTCAGAATTTTTATTTTAAAAC;^ 

Seq ID 34 

GACNCTGCATAGGGTNNGGNGTGGGGAAC7UVATCTGG 
25 CCTCCTGGATTTAAAATTCAAAGGTATTATO^GAGAACAGTACAAATTTGAAAAGACCTGTAGAT 

TAAATAAGAGTATTGTTTCAAGGATAATTTTTTGATTGTO^ 

AATTACTTCCAAAGAATTCAGAAAATTGTGTGTGTGGGAGGCAGGGTAGCAAGATAAAAAGAGG^ 
GGACAGCTGGGGTTGGTAAAAACAAATGTAAAATGACAACATTTGGGAGAAACCTOT^ 
TTACTTTTGTGTCCAATTTGGCTAGGGCACAATACCANTATCCTATGTACTTTAA^ 
30 TAAGTCGAAGGGAGTAACCANTTAATTCCAATTTAAACACTTArrCCCGGGTC 

GTATTTTATACCATGTGGGTTANCATCCTATAGGNCAGCOTCACCTTTTTTAAGGG^ 

Seq ID 37 

MDPNCSCSTGGSCTCSSSCGCKNCKCTSCKKSCCSCCPVGCSKCAQGCVCKGASDKCTCCA 

35 

Seq ID 38 

TCCCGACTTCAGCAGCCCGACTGCCTTCTTGTCGCTTACACCGTTCCTCCAGATTCACCAGATCTC 
GGAATGGACCCCAACTGCTCCTGCTCCACCGGCGGCTCCTGCACCTGCTCCAGCTCCTGC^ 
AAGAACTGCAAATGCACCTCCTGCAAGAAGAGCTGCTGCTCCTGCTGCCCCGTGGGCT^ 
40 TGTGCCCAGGGCTGTGTCTGCAAAGGTGCCTCGGACAAGTGCACGTGCTGTGCCTGAAGTGAC^ 
CAGTGCTGCTGCCCTCAGGTGTAAATAATTTCCGGACCAACTCAGAGTCTTGCCGTACACCTCC^^ 
CCAGTTTACTAAACCCCGTTTTCTACCGAGCATGTGAATAATAAAAGCCTGTTTATTCT 

SEQIDNO:39 
45 Nucleotide sequence of ires-GFP DN A fragment 

CTAGAGTGTGATTTTAAGGGCGAATTCTGCAGATATCCATCACACTGGCGGCCGCACTAGAG^ 
TCGCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGTGT 
GTTTGTCTATATGTGATTTTCCACCATATTGCCGTCTTTTGG(^ 
50 CCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAT^ 

GAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAG^ 
TTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTCCCTCTGCGGCCAAAAGCCACGTC^ 

TACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTG^ 
ATGGCTCTCCTCAAGCGTAGTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCAT^ 
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AATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAAG^ 
CCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGATACCATGGTG^^ 
AGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGT 
TCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCA 
5 CCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCT 
TCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACG 
TCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG 
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCC 
TGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGA 
10 ACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC 
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCA 
CCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGA 
CCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACT 

15 SEQIDNO:40 

Nucleotide sequence of DNA fragment containing human HIF-la protein coding sequence 
CTAGCCGTAGAATCCGACCGATTCACCATGGAGGGCGCCGGCGGCGCGAACGACAAGAAAAAGATA 

agttctgaacgtcgaaaagaaaagtctcgagatgcagccagatctcggcgaagtaaagaatctgaa 

20 gttt.tttatgagcttgctcatcagttgccacttccacataatgt6ag 

tctgtgatgaggcttaccatcagctatttgcgtgtgaggaaacttctggatgctot 
attgaagatgacatgaaagcacagatgaattgcttttatttgaaagccttggatggttttg 
gttctcacagatgatggtgacatgatttacatttctgataatgtgaacaaatacatg^ 
cagtttgaactaactggacacagtgtgtttgattttactcatgcatgtgaccatgaggaaao^ 

25 gaaatgcttacacacagaaatggccttgtgaaaaagggtaaagaacaaaacacacagccs^ 
tttctcagaatgaagtgtaccctaactagccgaggaagaaotatgaacataaagtctgc^ 
aaggtattgcactgcacaggccacattcacgtatatgataccaacagtaaccaacctct^^ 
tataagaaaccticctatgacctgcttggtgctgatttgtgaacccattcctcacccatca^ 
gaaattcctttagatagcaagactttcctcagtcgacacagcctggatatgaaattttcttattgt 

30 gatgaaagaattaccgaattgatgggatatgagccagaagaacttttaggccgctcaattt^ 
tattatcatgctttggactctgatcatctgaccaaaactcatcatgatatgtttactaaag 
gtcaccacaggacagtacaggatgcttgccaaaagaggtggatatgtctgggttgaaactcaag 
actgtcatatataacaccaagaattctcaaccacagtgcattgtatgtgtc 
ggtattattcagct^cgacttgattttctcccttcaacaaacagaatgtgtccttaaaccggto^ 

35 tcttcagatatgaaaatgactcagctattcaccaaagttgaatcagaagataca^ 

gacaaacttaagaaggaacctgatgctttaactttgctggccccagccgctggagacacaai^ 
tctttagattttggcagcaacgacacagaaactgatgaccaga^cttgaggaagt^ 
aatgatgtaatgctcccctcacccaacgaaaaattacagaatataaatttggc^ 
cccaccgctgaaacgccaaagccacttcgaagtagtgctgaccctgcactcaatcaagaagttgca 

40 ttaaaattagaaccaaatccagagtcactggaactttcttttaccatgccccagattcag^ 
acacctagtccttccgatggaagcactagacaaagttcacctgagcctaatagtcccagtg;^^ 
tgtttttatgtggatagtgatatggtcaatgaattcaagttggaattggtagaaaaacot 

GAAGACACAGAAGCAAAGAACCCATTTTCTACTCAGGACACAGATTTAGACTTGG^ 
CCCTATATCCCAATGGATGATGACTTCCAGTTACGTTCCTTCGATCAGTTGTCACCyiTO 

45 AGTTCCGCAAGCCCTCAAAGCGCAAGTCCTCAAAGCACAGTTACAGTATTCCAGC^^ 
CAAGAACCTACTGCTAATGCCACCACTACCACTGCCACCACTGATGAATTAAAAAC^^ 
GACCGTATGGAAGACATTAAAATATTGATTGCATCTCCATCTCCTACCCACATACATAAAGAAA^ 
ACTAGTGCCACATCATCACCATATAGAGATACTCAAAGTCGGACAGCCTCACCAAACAGAG^ 
AAAGGAGTCATAGAACAGACAGAAAAATCTCATCCAAGAAGCCCTAACGTGTTATCTGTC^ 

50 AGTCAAAGAACTACAGTTCCTGAGGAAGAACTAAATCCAAAGATACTAGCTT^ 
AGAAAGCGAAAAATGGAACATGATGGTTCACTTTTTCAAGCAGTAGGAATTC^ 
CAGCCAGACGATCATGCAGCTACTACATCACTTTCTTGGAAACGTGTAAAAGGATG^^ 
GAACAGAATGGAATGGAGCAAAAGACAATTATTTTAATACCCTCTGATTTAGC^^ 
GGGCAATCAATGGATGAAAGTGGATTACCACAGCTGACCAGTTATGATTCT^ 
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ATACAAGGCAGCAGAAACOTACTGCAGGGTGAAGAATTACTCAGAGCTTTGGATCAAGTTAAC 
GCGGATCCGACGGGGATCCT 

SEQIDN0:41 

5 

Nucleotide sequence of DNA fragment containing human E PASlorotein coding sequence 

AGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCAGCGACAATGACAGCTGACAAGGAGAAGAAA 
AGGAGTAGCTCGGAGAGGAGGAAGGAGAAGTCCCGGGATGCTGCGCGGTGCCGGCGGAGCAAGGAG 

10 ACGGAGGTGTTCTATGAGCTGGCCCATGAGCTGCCTCTGCCCCACAGTGTGAGCTCCCATCTG^ 
AAGGCCTCCATCATGCGACTGGAAATCAGCTTCCTGCGAACACACAAGCTCCTCTCCTCAGTTTGC 
TCTGAAAACGAGTCCGAAGCCGAAGCTGACCAGCAGATGGACAACTTGTACCTGAAAGCCTTGGAG 
GGTTTCATTGCCGTGGTGACCCAAGATGGCGACATGATCTTTCTGTCAGAAAACATCAGCAAGOT^ 
ATGGGACTTACACAGGTGGAGCTAACAGGACATAGTATCTTTGACTiaiCTCA 

15 GAGGAGATTCGTGAGAACCTGAGTCTCAAAAATGGCTCTGGTTTTGGGAAAAAAAGCAAAGACATG 
TCCACAGAGCGGGACTTCTTCATGAGGATGAAGTGCACGGTCACCAACAGAGGCCGTACT^ 
CTCAAGTCAGCCACCTGGAAGGTCTTGCACTGCACGGGCCAGGTGAAAGTCTACAACAACTGCCCT 
CCTCACAATAGTCTGTGTGGCTACAAGGAGCCCCTGCTGTCCTGCCTCATCATCATGTGTG^^ 
ATCCAGCACCCATCCCACATGGACATCCCCCTGGATAGCAAGACCTTCCTGAGCCGCCACAGCATC 

20 GACATGAAGTTCACCTACTGTGATGACAGAATCACAGAACTGATTGGTTACCACCCTGAGGAGCTG 
CTTGGCCGCTCAGCCTATGAATTCTACCATGCGCTAGACTCCGAGAACATGACCAAGAGTCAC^^ 
AACTTGTGCACCAAGGGTCAGGTAGTAAGTGGCCAGTACCGGATGCTCGCAAAGCATGGGGGCTA^ 
GTGTGGCTGGAGACCCAGGGGACGGTCATCTACAACCCTCGCAACCTGCAGCCCCAGTGCATCATG 
TGTGTCAACTACGTCCTGAGTGAGATTGAGAAGAATGACGTGGTGTTOTCCATGGACCA^^ 

25 TCCCTGTTCAAGCCCCACCTGATGGCCATGAACAGCATCTTTGATAGCAGTGGCAAGGGGGCTC 
TCTGAGAAGAGTAACTTCCTATTCACCAAGCTAAAGGAGGAGCCCGAGGAGCTGGCCCAGCTGGCT 
CCCACCCCAGGAGACGCCATCATCTCTCTGGATTTCGGGAATCAGAACTTCGAGGAGTCCTCAGCC 
TATGGCAAGGCO^TCCTGCCCCCGAGCCAGCCATGGGCCACGGAGTTGAGGAGCCACAGCACCC^^ 
AGCGAGGCTGGGAGCCTGCCTGCCTTCACCGTGCCCCAGGCAGCTGCCCCGGGCAGCACCACCCCC 

30 AGTGCCACCAGCAGCAGCAGCAGCTGCTCCACGCCCAATAGCCCTGAAGACTATTACACATCOTTG 
GATAACGACCTGAAGATTGAAGTGATTGAGAAGCTCTTCGCCATGGACACAGAGGCCAAGGACCAA 
TGCAGTACCCAGACGGATTTCAATGAGCTGGACTTGGAGACACTGGC7VCCCTATATCCCCA 
GGGGAAGACTTCCAGCTAAGCCCCATCTGCCCCGAGGAGCGGCTCTTGGCGGAGAACCCACAG 
ACCCCCCAGCACTGCTTCAGTGCCATGACAAACATCTTCCAGCCACTGGCCCCTGTAGCCCCGCAC 

35 AGTCCCTTCCTCCTGGACAAGTTTCAGCAGCAGCTGGAGAGC?^GAAGACAGAGCCCGAGC^^ 
CCCATGTCCTCCATCTTCTTTGATGCCGGAAGCAAAGCATCCCTGCCACCGTGCTGTGGCCA^^ 
AGCACCCCTCTCTCTTCCATGGGGGGCAGATCCAATACCCAGTGGCCCCCAGATCCACCATTACAT 
TTTGGGCCCACAAAGTGGGCCGTCGGGGATCAGCGCACAGAGTTCTTGGGAGCAGCGCCGTTGGGG 
CCCCCTGTCTCTCCACCCCATGTCTCCACCTTCAAGACAAGGTCTGCAAAGGGTTTTGGGG 

40 GGCCCAGACGTGCTGAGTCCGGCCATGGTAGCCCTCTCCAACT^GCTGAAGCTG^^ 

GAGTATGAAGAGCAAGCCTTCCAGGACCTGAGCGGGGGGGACCCACCTGGTGGCAGCACCTCACAT 
TTGATGTGGAAACGGATGAAGAACCTCAGGGGTGGGAGCTGCCCTTTGATGCCGGACAAGCCACTG 
AGCGCAAATGTACCCAATCATAAGTTCACCCAAAACCCCATCAGGGGCCTGGGCCATC 
CATCTGCCGCTGCCACAGCCTCCATCTGCCATCAGTCCCGGGGAGAACAGCAAGAGCAGGTTCCCC 

45 CCACAGTGCTACGCCACCCAGTACCAGGACTACAGCCTGTCGTCAGCCCACAAGGTGTCAGGC^ 
GCAAGCCGGCTGCTCGGGCCCTCATTTGAGTCCTACCTGCTGCCCGAACTGACCAG^^ 
GAGGTGAACGTGCCCGTGCTGGGAAGCTCCACGCTCCTGCAAGGAGGGGACCTCCTCAGAGCCCTG 
GACCAGGCCACCTGAGCCAGGCCTTCTACCTGGGCAGCACCTCTGCCCACGCCGAGCCCTATGCAG 
TCTCGGCCGCAAGCTATCAGATCTGCCGGTCTCCCTATAGTGAGTCGTATTAATTTCGATAAGCCA 

50 GGTT 
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SEQIDNO:42 

The nucleotide seauencft of pSMART CMV-HIF 

agatcttgaataataaaatgtgtctttgtccgaaatacgcgttttgagatttctc^ 

5 attcatgtcgcgcgatagtggtgtttatcgccgatagagatggcgatattggaaaaattgatatot 
gaaaatatggcatattgaaaatgtcgccgatgtgagtttctgtgtaactgata^ 
aaaagtgatttttgggcatacgcgatatctggcgatagcgcttatatcgtttacgggggatggcga 
tagacgactttggtgacttgggcgattctgtgtgtcgcaaatatcgcagtttcgatataggtgaca 
gacgatatgaggctatatcgccgatagaggcgacatcaagctggcacatggcc^ 

10 ctatacattgaatcaatattggccattagccatattattcattggttatatagcataaa^ 
tggctattggccattgcatacgttgtatccatatcgtaatatgtacatttatattggct^ 
aacattaccgccatgttgacattgattattgactagttattaatagtaatcaatta 
agttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccxscct^ 
gcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagt;^^ 

15 tttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatc^ 
tcatatgccaagtccgccccctattgacgtcaatgacggtaaatggcccgcctggcattat^ 
gtacatgaccttacgggactttcctacttggcagtacatctacgtattagtcatcgctattacc^^ 
ggtgatgcggttttggcagtacaccaatgggcgtggatagcggtt'^ 
tctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgg^ 

20 tcgtaacaactgcgatcgcccgccccgttgacgcaaatgggcggtaggcgtgtacggt^ 
tatataagcagagctcgtttagtgaaccgggcactcagattctgcggtctgagtcccra^ 

GGGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCTAGTTTC^^ 

gagatcctacagttggcgcccgaacagggacctgagaggggcgcagaccctacct^ 
ctgatcgtaggatccccgggacagcagaggagaacttacagaagtcttctggaggtgttcctggcc 

25 agaacacaggaggacaggtaagattgggagaccctttgacattggagcaaggcgctcaagaagtta 
gagaaggtgacggtacaagggtctcagaaattaactactggtaactgtaattgggcgct^ 
gtagacttatttcatgataccaactttgtaaaagaaaaggactggcagctgagggat^ 
ttgctggaagatgtaactcagacgctgtcaggacaagaaagagaggcctttgaaag;^ 
gcaatttctgctgtaaagatgggcctccagattaataatgtagtagatggaaaggc^ 

30 ctcctaagagcgaaatatgaaaagaagactgctaataaaaagcagtctgagccctctgaagaatat 
ctctagagtcgacgctctcattacttgtaacaaagggagggaaagtatgggaggaca 
ggaagtatttatcactaatcaagcac?vagtaatacatgagaaacttot 
ctccaaaaaattttgtttttacaaaatccctggtgaacatggt^ 
ccgggctgcaggagtggggaggcacgatggccgctttggtcgaggcggatccggccatta^ 

35 ttattcattggotatatagcataaatcaatattggctattggc^^ 
cataatatgtacatttatattggctcatgtccaacattaccgccatgti^ 
agttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttaca 
taacrracggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataa 
acgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtc^ 

40 taaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgc^^ 

gacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggc^^ 

tacatctacgtattagtcatcgctattaccatggtgatgcggttttggcag 
ggatagcggtttgactcacggggatttccaagtctccaccccattgacg 
tggcaccaaaatcaacgggactotccaaaatctcgtaacaactccgccccat^^ 
45 ggtaggcatgtacggtgggaggtctatataagcagagctcgtttagtgaacostcagatcgcct^ 
agacgccatccacgctgttttgacotccatagaagacaccgggaccgatcc^ 
gaacggtgcattggaagcttggtaccggctagccgtagaatccgaccgattcaccat^ 

cggcggcgcgaacgacaagaaaaagataagtoeraaacgtcgaaa^ 
cagatctcggcgaagtaaagaatctgaagttttttatgagertgctc^ 
50 taatgtgagttcgcatctogataaggcctctgtgatgaggcttaccat^ 
gaaacttctggatgctggtgatttggatattgaagatgacatgaaagca 
tttgaaagccttggatggttttgttatggttctcacagatgatggtgacat 
taatgtgaacaaatao^tgggattaactcagtttgaactaactggac^ 
tcatccatgtgaccatgaggaaatgagagaaatgcttacacacagaaatggcc 
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TAAAGAACAAAACACACAGCGAAGCTTTTTTCTCAGAATGAAGTGTACCCTAACTAGCCGAGGAAG 
AACTATGAACATAAAGTCTGCAACATGGAAGGTATTGCACTGCACAGGCCACATTCACGTATATGA 
TACCAACAGTAACCAACCTCAGTGTGGGTATAAGAAACCACCTATGACCTGCTTGGTGCTGAT^^ 
TGAACCCATTCCTCACCCATCAAATATTGAAATTCCTTTAGATAGCAAGACTTTCCTCAGTCGACA 
5 CAGCCTGGATATGAAATTTTCTTATTGTGATGAAAGAATTACCGAATTGATGGGATATGAGCCAGA 
AGAACTTTTAGGCCGCTCAATTTATGAATATTATCATGCTTTGGACTCTGATCATCTGACCAAAAC 
TCATCATGATATGTTTACTAAAGGACAAGTCACCACAGGACAGTACAGGATGCTTGCCAAAAGAGG 
TGGATATGTCTGGGTTGAAACTCAAGCAACTGTCATATATAACACCAAGAATTCTCAACCACAGTG 
CATTGTATGTGTGAATTACGTTGTGAGTGGTATTATTCAGCACGACTTGATTTTCTCCCTTCAA 

10 AACAGAATGTGTCCTTAAACCGGTTGAATCTTCAGATATGAAAATGACTCAGCTATTCACCAAAGT 
TGAATCAGAAGATACAAGTAGCCTCTTTGACT^CTTAAGAAGGAACCTGATGC^^^ 
GGCCCCAGCCGCTGGAGACACAATCATATCOTTAGATTTTGGCAGCAACGACACAGAAACTGATGA 
CCAGCAACTTGAGGAAGTACCATTATATAATGATGTAATGCTCCCCTCACCCAACGAAAAATTACA 
GAATATAAATTTGGCAATGTCTCCATTACCCACCGCTGAAACGCCAAAGCCACTTCG^ 

15 TGACCCTGCACTCAATCAAGAAGTTGCATTAAAAOTAGAACCAAATCCAGAGTCACTG^ 

TTTTACCATGCCCCAGATTCAGGATCAGACACCTAGTCCOTCCGATGGAAGCACTAGACA^ 
ACCTGAGCCTAATAGTCCCAGTGAATATTGTTTTTATGTGGATAGTGATATGGTCAATGAATTCAA 
GTTGGAATTGGTAGAAAAACTTTTTGCTGAAGACACAGAAGCAAAGAACCCATTTTCTACT^^ 
CACAGATTTAGACTTGGAGATGTTAGCTCCCTATATCCCAATGGATGATGACTTCCAGTTACGTTC 

20 CTTCGATCL?^GTTGTCACCATTAGAAAGCAGTTCCGCyU^GCCCTGAAAGCGCAAG 

AGTTACAGTATTCCAGCAGACTCAAATACAAGAACCTACTGCTAATGCCACCACTACC^^ 
CACTGATGAATTAAAAACAGTGACAAAAGACCGTATGGAAGACATTAAAATATTGATT^ 
ATCTCCTACCCACATACATAAAGAAACTACTAGTGCCACATCATCACCATATAGAGATACTCAAAG 
TCGGACAGCCTCACCTU^CAGAGCAGGAAAAGGAGTCATAGAACAGACAGAAAAATCTCATC 

25 AAGCCCTAACGTGTTATCTGTCGCTTTGAGTCAAAGAACTACAGTTCCTGAGGAAGAACTAAATCC 
AAAGATACTAGCTTTGCAGAATGCTCAGAGAAAGCGAAAAATGGAACATGATGGTTCAC 
AGCAGTAGGAATTGGAACATTATTACAGCAGCCAGACGATCATGCAGCTACTACATCACTTO 
GAAACGTGTAAAAGGATGCAAATCTAGTGAACAGAATGGAATGGAGCAAAAGACAATTATOT 
ACCCTCTGATTTAGCATGTAGACTGCTGGGGCAATCAATGGATGAAAGTGGATTACCAC^^ 

30 CAGTTATGATTGTGAAGOTAATGCTCGTATACAAGGCAGCAGAAACCTACTGCAGG^^ 

ACTCAGAGCTTTGGATCAAGTTAACTGAGCGGATCCGACGGGGATCCTCTAGCGTTATCCATCACA 
CTGGCGGCCGCGACTCTAGAGTCGACCTCGAGGGGGGGCCCGGACCTACTAGGGTGCTGTGGAAGG 
GTGATGGTGCAGTAGTAGTTAATGATGAAGGAAAGGGAATAATTGCTGTACCATTAACCAGGACT^ 
AGTTACTAATAAAACCAAATTGAGTATTGTTGCAGGAAGCAAGACCCAACTACCATTGTCAGCTC 

35 GTTTCCTGACCTCAATATTTGTTATAAGGTTTGATATGAATCCCAGGGGGAATCTCAACC^^ 
ACCCAACAGTCAGAAAAATCTAAGTGTGAGGAGAACACAATGTTTCAACCTTATTGTTATAAT 
GACAGTAAGAACAGCATGGCAGAATCGAAGGAAGCAAGAGACCAAGAATGAACCTGAAAGAAGAAT 
CTAAAGAAGAAAAAAGAAGAAATGACTGGTGGAAAATAGGTATGTTTCTGTTATGCTTAGCAGGA^ 
CTACTGGAGGAATACTTTGGTGGTATGAAGGACTCCCACAGCAACATTATATAGGGTTGGTC^ 

40 TAGGGGGAAGATTAAACGGATCTGGCCAATCAAATGCTATAGAATGCTGGGGTTCCTTC 
GTAGACCATTTCAAAATTACTTCAGTTATGAGACCAATAGAAGCATGCATATGGATAATA^ 
CTACATTATTAGAAGCTTTAACCAATATAACTGCTCTATAAATAACAAAACAGAATTAGAAACATG 
GAAGTTAGTAAAGACTTCaXSGCATAACTCCTTTACCTATTTCTTCTGAAGCTi^ 
TAGACATAAGAGAGATTTTGGTATAAGTGCAATAGTGGCAGCTATTGTAGCCGCTACTGCTATTC 

45 TGCTAGCGCTACTATGTCTTATGTTGCTCTAACTGAGGTTAACAAAATAATGGAAGTAC^^ 
TACTTTTGAGGTAGAAAATAGTACTCTAAATGGTATGGATTTAATAGAACGACAAATAAAGJ^ 
ATATGCTATGATTCTTCAAACACyiiTGCAGATGTTCAACTGTTAAAGGAAAGAC^ 
GACATTTAATTTAATTGGATGTATAGAAAGAACACATGTATTTTGTCATACTGGTCATCCCTGGAA 
TATGTCATGGGGACATTTAAATGAGTCAACACAATGGGATGACTGGGTAAGCAAAATGGAAGATT^ 

50 AAATCAAGAGATACTAACTACACTTCATGGAGCCAGGAAO^TT^^ 
CAATACACCAGATAGTATAGCTCAATTTGGAAAAGACCTTTGGAGT^ 
TGGATTGGGAGCTTCCATTATAAAATATATAGTGATGTTTTTGCTTATTTATTTGTO 
TTCGCCTAAGATCCTCAGGGCCCTCTGGAAGGTGACCAGTGGTGCAGGGTCCTCCGGC!VGTCG^ 
CCTGAAGAAAAAATTCCATCACAAACATGCATCGCGAGAAGACACCTGGGACCAGGCC 

55 CATACACCTAGCAGGCGTGACCGGTGGATCAGGGGACAAATACTACAAGCAGAAGTACTCC^^ 
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CGACTGGAATGGAGAATCAGAGGAGTACAACAGGCGGCCAAAGAGCTGGGTG^GTCAATCGAGGC 
ATTTGGAGAGAGC'TATATTTCCGAGAAGACCAAAGGGGAGATTTCTCAGCCTGGGGCGGCTATCAA 
CGAGCACAAGAACGGCTCTGGGGGGAACAATCCTCACCAAGGGTCCTTAGACCTGGAGATTCGAAG 
CGAAGGAGGAAACATTTATGACTGTTGCATTAAAGCCCAAGAAGGAACTCTCGCTATCCCTTGCTC 
5 TGGATTTCCCTTATGGCTATTTTGGGGACTAGTAATTATAGTAGGACGCATAGCAGGCTATGGATT 
ACGTGGACTCGCTGTTATAATAAGGATTTGTATTAGAGGCTTAAATTTGATATTTGAAATAAT 
AAAAATGCTTGATTATATTGGAAGAGCTTTAAATCCTGGCACATCTCATGTATCAATGCCTCAGTA 
TGTTTAGAAAAACAAGGGGGGAACTGTGGGGTTTTTATGAGGGGTTTTATAAATGATTAT^^ 
AAAAAGAAAGTTGCTGATGCTCTCATAACCTTGTATAACCCAAAGGACTAGCTCATGTTGCTAGGC 

10 AACTAAACCGCAATAACCGCATTTCTGACGCGAGTTCCCCATTGGTGACGCGTTAACTTCCT^ 
TTACAGTATATAAGTGCTTGTATTCTGACAATTGGGCACTCAGATTCTC^ 
CTGCTGGGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCT^^ 
TGTTCGAGATCCTACAGAGCTCATGCCTTGGCGTAATCATGGTCATAGCTGTTTCCT^^ 
TGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGG^ 

15 TAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTG 
TCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTC 
TCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCAC 
TCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACA^ 
GGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCC 

20 CCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGA 
TACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGA 
TACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTC 
AGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 
TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCA 

25 GCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGG 
TGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACC 
TTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCT^CAAACCACCGCTGGTAGC^ 

gtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatc^ 
gggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaagg 

30 atcttcacctagatcctti^aaattaaaaatgaagttttaaatc^ 

acttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatct^ 
tcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggc 
cccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccag 
ccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaat 

35 tgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgtt^ 
acaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagc^^ 
aggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgtt 
gtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctc^ 
gtcatgccatccgtaagatgcttttctgtgactggtgagtactca^ 

40 tgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcaga 
actttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctg 
ttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatct^ 
agcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagg^ 
t^tgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggtt^ 

45 atgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcac^^ 
cgaaaagtgccacctaaattgtaagcgttaatattttgttaaaattcgcgttaaattt™ 

TCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACC^ 
AGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGA^^ 
TCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTJ^ 
50 TTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTT 
GACGGGGAAAGCCAACCTGGCTTATCGAAATTAATACGACrCACTATAGGGAGACCGGC 
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SEQIDNO:43 

The nnclentide seQuencft nf pSMART CMV -emptv 

5 AGATCTTGAATAATAAAATGTGTGTTTGTCCGAAATACGCGTTTTGAGATTTCTGTCGCCGACTAA 
ATTCATGTCGCGCGATAGTGGTGTTTATCGCCGATAGAGATGGCGATATTGGAAAAATTGATAT^ 
GAAAATATGGCATATTGAAAATGTCGCCGATGTGAGTTTCTGTGTAACTGATATC^ 
AAAAGTGATTTTTOGGCATACGCGATATCTGGCGATAGCGCTTATATCGTTTACGGGGGATGGCGA 
TAGACGACTTTGGTGACTTGGGCGATTGTGTGTGTCGCAAATATCGCAGTTTCGAT^^^ 
10 GACGATATGAGGCTATATCGCCGATAGAGGCGACATCAAGCTGGCACATGGCCAATGCATATC^^ 
CTATACATTGAATCAATATTGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAA 
TGGCTATTGGCCATTGCATACGTTGTATCCTVTATCGTAATATGTACATTTATATTGGCTC^ 
AACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAAOT^ 
AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTA7ATGGCCCGCCTGG " 
15 GCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAAT^^ 
TTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTAC^^ 
TCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCC^ 
GTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC^ 
GGTGATGCGGTTTTGGCAGTACACCAATGGGCGTGGATAGCGGTTTGACTCACG 
20 TCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACOT^ 
TCGTAACAACTGCGATCGCCCGCCCCGTTGACGCAAATGGGCGGTAGGCGTGT^^ 
TATATAAGCAGAGCTCGTTTAGTGAACCGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTC 
GGGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCTAGTT^^^ 
GAGATCCTACAGTTGGCGCCCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAACCT^ 
25 CTGATCGTAGGATCCCCGGGACyiGCAGAGGAGAACTTACAGAAGTCTTCTGGAGGTGTTCCTGGCC 
AGAACACAGGAGGACAGGTAAGATTGGGAGACCCOTTGACATTGGAGCAAGGCTC^ 
GAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGTAATTGGGCGCTA^^ 
GTAGACTTATTTCTVTGATACCAACTTTGTAAAAGAAAAGGACTGGCAGCTC 
TTGCTGGAAGATGTAACTCAGACGCTGTCAGGACAAGAAAGAGAGGCCTTTGAAAGAACATGGTGG 
30 GCAATTTCTGCTGTAAAGATCGGCCTCCAGATTAATAATGTAGTAGATGG^ 

CTCCTAAGAGCGAAATATGAAAAGAAGACTGCTAATAAAAAGCAGTCTGAGCCCTCTGAAGAATAT 
CTCTAGAGTCGACGCTCTCATTACTTGTAACAAAGGGAGGGAAAGTATGGGAGGACAGACACCATG 
GGAAGTATTTATCACTAATCAAGCACAAGTAATACATGAGAAACTTTTACTACAGCA^^ 
CTCCAAAAAATTTTGTTTTTACAAAATCCCTGGTGAACATGGTCGACTCT^^ 
35 CCGGGCTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCATTAGCCATA 
TTATTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTC 
CATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCATGTTGACA 
AGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGT^ 
TAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATG 
40 ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGT^ 
TAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTAOT^ 
GACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTC 
TACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGT 
GGATAGCGGTTTGACTCACGGGGATTTCCAAGTOTCCACCCCIATTGACGTCAATC 
45 TGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTC^^ 

GGTAGGCATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGG 
CCGCGACTCTAGAGTCGACCTCGAGGGGGGGCCCGGACCTACTAGGGTGCTGTGGAAGGGTGATGG 
TGCAGTAGTAGTTAATGATGAAGGAAAGGGAATAATTGCTCTACCATTAACCAGGACTAAGTTACT 
AATAAAACCAAATTGAGTATTGTTGCAGGAAGCAAGACCCAACTACCATTGTCAGCTGTGTTO^ 
50 GACCTCAATATTTCTTATAAGGTTTGATATGAATCCCAGGGGGAATCTCAACCCCTATT^^ 
AGTCAGAAAAATCTAAGTGTGAGGAGAACACAATGTTTCAACCTT^ 
AGAACAGCATGGCAGAATCGAAGGAAGCAAGAGACCAAGAATGAACCTGAAAGAAGAATC 
AGAAAAAAGAAGAAATGACTGGTGGAAAATAGGTATGTTTCTGTTATGCTTAGC^ 
AGGAATACTTTGGTGGTATGAAGGACTCCCACAGCAACATTATATAGGGTTGGTGGCGAT^ 
55 AAGATTAAACGGATCTGGCCAATCAAATGCTATAGAATGCTGGGGTTCCTTC 
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ATTTCAAAATTACTTCAGTTATGAGACCAATAGAAGCATGCATATGGATAAT^^ 
ATTAGAAGCTTTAACCAATATAACTGCTCTATAAATAACAAAACAGAATTAGAAACATGGAA^ 
GTAAAGACTTCTGGCATAACTCCTTTACCTATTTCTTCTGAAGCTAACACTGGACTAAT^^ 
AAGAGAGATTTTGGTATAAGTGCAATAGTGGCAGCTATTGTAGCCGCTACTGCTATTGCTGCTAGC 
5 GCTACTATGTCTTATGTTGCTCTAACTGAGGTTAACAAAATAATGGAAGTACAAAATCATACTT^ 
GAGGTAGAAAATAGTACTCTAAATGGTATGGATTTAATAGAACGACAAATAAAGATATO^ 

atgattcttcaaacacatgcagatgttcaactgttaaaggaaagacaacaggtagaggagacatot 
aatttaattggatgtatagaaagaacacatgtattttgtcatactggtcatccctggaatatg^^ 
tggggacatttaaatgagtcaacacaatgggatgactgggtaagcaaaatggaagatttaaatc^ 

10 gagatactaactacacttcatggagccaggaacaatttggcacaatccatgataacattc^ 
ccagatagtatagctcaatttggaaaagacctttggagtcatattggaa^ 
ggagcttccattataaaatatatagtgatgtttttgcttatttatttgttactaacctc^ 
aagatcctcagggccctctggaaggtgaccagtggtgcagggtcctccggcagtcgttacctgaag 
aaaaaattccatcacaaacatgcatcgcgagaagacacctgggaccaggcccaacac^ 

15 ctagcaggcgtgaccggtggatcaggggacaaatactacaagcagaagtactccaggaacga 
aatggagaatcagaggagtacaacaggcggccaaagagctgggtgaagtcaatcgaggcatttgga 
gagagctatatttccgagaagaccaaaggggagatttctcagcctggggcggct^^ 
aagaacggctctggggggaacaatcctcaccaagggtccttagacctggagattcgaagcgaagga 
ggaaacatttatgactgttgcattaaagcccaagaaggaactctcgctatcccttgctgl^^ 

20 cccttatggctattttggggactagtaattatagtaggacgcatagcaggctatggattacgtgg^ 
ctcgctgttataataaggatttgtattagaggcttaaatttgatatttgaaataat^^ 
cttgattatattggaagagctttaaatcctggcacatctcatgtatcaatccctcagtatgto 
aaaaacaaggggggaactgtggggtttttatgaggggttttataaatgattataagagtaa 
aagttgctgatgctctcataaccttgtataacccaaaggactagctcatgttgctaggc;^^ 

25 ccgcaataaccgcatttgtgacgcgagttccccattggtgacgcgttaacttcc 
atataagtgcttgtattctgacaattgggcactcagattctgcggtctc^ 
gctgaaaaggcctttctaataaatataattctctactcagtccctgtctctagtttgtctc 
gatcctacagagctcatgccttggcgtaatcatggtcatagctgtttcctgtc^ 
cgctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatc 

30 tgagctaactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctc^ 
agctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcot 
cctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaagg 
cggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggcca^ 
aaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctc 

35 agcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccagg 
cgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgt 
ccgcctttctcccttcgggaagcgtggcx3ctttctcatagctcacgctgtaggtatc 
tgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgct^ 
tatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagcc^ 

40 ctggtaacaggattagcagagcgaggtatgtagggggtgctact^gagttcttgaa 
actacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagt^^ 
aaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtcgttot 
agcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatct^ 
acgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatca;^^ 

45 cctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagt^ 
ctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgot^ 
tagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccag 
ctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccag 
gaagggccgagcgcagaagtggtcctgcaactttatccgcotccatcc^^ 

50 gggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgcc^ 
tcgtggtgtcacgctcgtcgtttggtatggcitcattcagctccggttc 
ttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatc^^ 
gtaagttggccgcagtgttatcactcatggttatggcagcactgcataai^ 
catccgtaagatgcttttctgtgactggtgagtactcaaccy^ 

55 ggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcag^ 
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AAGTGCTCATCATTGGAAA2VCGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGAT 
CCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAG^^ 
CTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTT 
GAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGOT 
5 GATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTT 

TGCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTC 
;^,Iupfp^fp;^CCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAG^ 
GTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCA^ 
GCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTOT 
10 GTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGG 
AAAGCCAACCTGGCTTATCGAAATTAATACGACTCACTATAGGGAGACCGGC 



